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Each year, approximately one third of the world's crops are destroyed by plant pests, amounting to 
billions of dollars in crop losses in the united States alone. Plants are susceptible to diseases and damage 
caused by an enormous number of different types of organisms, including virus, bacteria, fungi, algae, 
parasitic plants, weeds, insects, arachnids, and nematodes. The potential losses are kept in check by 

5 natural controlling mechanisms, and when these systems fail, by applications of various types of insecti- 
cides which typically act by attaching one specific, genetically controlled aspect of the target organism's 
metabolism. However, the efficacy of any given pesticide may be limited by the appearance and spread of 
resistance to the pesticide among the target population. The appearance and spread of insecticide 
resistance in wild populations argues for a genetic origin. First, a resistant genotype or trait appears in a 

w local population and then with continued insecticide use (and thus, disproportionate survival of individuals 
with this genotype or trait), the resistance rapidly increases in the population and via migration resistance 
may spread to regional and perhaps even worldwide populations. Resistance may arise as a genetic allele 
already present within a population, or it may arise de novo . Nonetheless, whatever the cause, in a 
population with a short generation time (which is characteristic of many insects), the resistance trait can 

75 spread rapidly and quickly render ineffective the planned pattern of pesticide application. 

The continued development of natural strategies for insect control could be enhanced by an under- 
standing of the genetic basis of the resistance in economically important pests. Such studies have been 
ongoing, particularly with regard to insect pests, and a great deal has been learned about the major types of 
resistance observed in insects. At least three types of insect resistance have been identified: decreased rate 

20 of uptake, increased rate or degradation and changes in the target site. To some extent, certain aspects of 
the genetic mechanisms of these types of resistance have been determined; however, knowledge of the 
specific genetic basis for resistance has not yet been effectively applied in the field to monitor the 
occurrence of resistance, or to assist in planning effective insecticide applications to avoid or alleviate the 
development of resistance. Modification of insecticide application patterns can be critical in cases in which 

-25 resistant insects are otherwise less fit than non-resistant insects; -application -of insecticide-to -which some 
individuals are resistant in these cases may actually select for increase in resistance in the population, when 
it might otherwise have been maintained only at low levels or entirely eliminated from the population. Thus, 
a method for exploiting the available knowledge of the genetic basis for resistance is greatly needed. 

Some of the most destructive of insect pests are found among the order Lepidoptera. The damage 

30 caused by lepidopterans is most frequently related to feeding activity of their larvae (caterpillars) on plants. 
Of the lepidopteran plant pests, among the most damaging are those insects related to the genus Heliothis . 
Two species of the genus Heliothis , H. virescens (the tobacco budworm) and H. armigera (American 
bollworm), and Helicoverpa zea (the corn ear worm) are responsible for a tremendous amount of damage to 
tobacco, cotton, corn, beans, alfalfa, and solanaceous plants in the United States. Over the years these 

35 pests have been controlled by application of a variety of insecticides; however, HL virescens has regularly 
developed resistance to compounds from virtually every major insecticide class. As one exception, until 
fairly recently the pyrethyroid class of insecticides continued to effectively control Heliothis in the field. 
Unfortunately, it has recently been noted that pockets of tolerance or resistance are beginning to appear in 
Heliothis virescens populations in various areas in the United States and in H. armigera and H. punctigera 

40 abroad. Because pyrethroids represent the most effective control of these insects, it is essential that 
widespread occurrence and/or spread of resistance to pyrethroids be avoided. 

Resistance to pyrethroids has been extensively studied in a variety of dipterans, and a number of 
different patterns of inheritance and explanations for resistance have been suggested. However, the basis 
for pyrethroid resistance or tolerance in lepidopterans generally, and in Heliothis specifically, has not yet 

45 been clarified. An understanding of the genetic mechanism of resistance, or even a definable genetic 
marker for resistance, would provide a much-needed basis for tracking the resistance trait accurately in a 
population. The present invention now provides the necessary tools for monitoring the occurrence and 
spread of resistance in a population, in particular for pyrethroid resistance in lepidopteran populations. 

50 SUMMARY OF THE INVENTION 

The present invention provides an isolated nucleic acid fragment encoding all or a portion of a non- 
dipteran sodium channel. This channel is believed to be target site for sensitivity to a variety of different 
insecticides, including pyrethroids, and is useful as a marker for such target-insensitive insecticide 
55 resistance. Preferably the fragment encodes a lepidopteran, coleopteran or homopteran sodium channel. 
Sodium channels from both resistant and sensitive strains are encompassed herein. The nucleic acid 
fragment provides the basis for probes useful in detecting the presence of the resistance trait in a 
population of insects to be evaluated. Also provided are vectors containing the resistance gene which may 
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be used to introduce a gene encoding insecticide resistance into beneficial insects, such as honey bees. 
The invention also provides the isolated protein or fragment encoded thereby, as well as biologically or 
immunologically active fragments thereof, which protein or fragments are useful in generation of polyclonal 
and monoclonal antibodies. Such antibodies can be used to detect the presence of sensitive or insensitive 

5 sodium channels. In a preferred embodiment, the insecticide target is a Heliothis sodium channel. 

The invention also provides a means for monitoring, both quantitatively and qualitatively, the level of 
resistance in any given pesticide target population. The presence or absence of a resistance trait is 
determined by hybridizing whole genomic DNA, cDNA or one or more restriction fragments from one or 
more individuals from the population with a nucleic acid probe based on the sequence of a nucleic acid 

io encoding a pesticide target site. Quantification of the trait is further obtained by calculating the number of 
the individuals having resistance relative to the number of sensitive individuals, and calculating the 
percentage occurrence of resistance. This in turn permits the observer to determine whether or not the 
contemplated pesticide application will be effective, whether alternate treatment may be required, or to 
predict when, at some time in the future, alternate treatment may be needed. In an alternate embodiment, 

75 the DNA can be used to express a recombinant protein or peptide, which in turn can be used to raise 
monoclonal antisera. Preferably antisera that can specify or identify both resistant and sensitive targets are 
raised. Such monoclonal antibodies may then be utilized in routine immunological procedures to determine 
the presence or absence of the resistant protein in a population. 

The present invention also provides the basts for an in vitro screen which will detect potential 

20 insecticidal activity. A nucleic acid sequence encoding a lepidopteran sodium channel can be inserted into 
a convenient host cell and a battery of potential insecticides tested for their ability to interfere with 
expression of either the gene or the encoded protein. 

BRIEF DESCRIPTION OF THE FIGURES 

"25 ■ " ~~ " " 

Figure 1 illustrates the nucleotide and amino acid sequences of the Heliothis clone hscpt, in 
comparison with the nucleotide and amino acid sequence of the para locus (sodium channel) of Drosophila 
melanogaster . "Dm" = Drosophila sequence; "scd" = portions of the Heliothis sequence; the numbers 
after "scd" refer to various subclones used to determine the sequence. The underlined amino acid 

30 sequences are membrane-spanning domains of the sodium channel. Superimposed above the sequences 
are the specific sequences of various primers (e.g. HSC 3455 + ) used in cloning and/or sequencing 
procedures. Numbering is based on the Drosophila homologue sequence to the Heliothis sodium channel. 

Figure 2 shows Restriction Fragment Length Polymorphisms (RFLPs) developed utilizing a labelled 
hcspl DNA sequence as a probe. 11 RR" identifies DNA derived from resistant individuals and "SS" refers to 

35 DNA derived from sensitive individuals. The presence or absence or resistant and sensitive individuals is 
made by the vial test described by Campanhola and Plapp, J. Econ. Entomol. , 82:1577-1533, 1989. 
Protocols for the procedure are described in Example 3. 

DETAILED DESCRIPTION OF THE INVENTION 

40 

As described in detail in the following Examples, the Heliothis sodium channel is isolated by 
amplification of Heliothis genomic DNA from an inbred susceptible strain using degenerate primers 
homologous to a portion of a sodium channel gene from Drosophila melanogaster (Loughney et aL Cell 
58:1143-1154, 1989), as described in Example 2. A 184 bp amplification product is obtained which, upon 

45 sequencing, is found to encode an identical amino acid sequence when compared to the same region in the 
Drosophila gene. This PCR product is then labelled and hybridized to restriction enzyme-digested Heliothis 
genomic DNA. The highest molecular weight DNA fragment identified is from an EcoRI digest. 

Genomic DNA is then isolated from a resistant Heliothis strain and digested to completion with EcoRI. A 
genomic library is constructed in a g Zap II vector, and a labelled 184 bp fragment is then used to screen 

50 this library. One positive plaque yields a genomic clone of approximately 8000 bp which is referred to as 
"hscpl." This clone shows significant homology to the published Drosophila sequence (Figure 1). 

Based on the hspc 1 sequence, a pair of primers designated 41 16 + , and 4399- (as depicted in Figure 
1) are used to amplify fragments of the sodium channel gene from both resistant and susceptible Heliothis 
individuals. Fragments are digested with either Rsal, Sau3AI or Msel. The restriction fragments are then 

55 separated and analyzed by gel electrophoresis. The resulting Restriction Fragment Length Polymorphisms 
(RFLPs) show distinct patterns unique to resistant and susceptible individuals. This demonstrates the utility 
of a nucleic acid sequence for defining genetic RFLP patterns useful for identifying resistant individuals 
within a population (Figure 2). 
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By homology with the known nucleic acid sequence for a Drosophila sodium channel, it is presumed 
that the isolated Heliothis sequence represents a portion of the corresponding Heliothis channel. Also, by 
comparison with the available information regarding the Drosophila channel as being the target site of 
pyrethroid action, it is reasonable to extrapolate this function in Heliothis as well. However, whether or not 

5 the isolated sequence represents the target site, or a genetic locus that is tightly linked with resistance, the 
RFLP results described above show that difference in the DNA is a reliable marker for identifying 
differences in susceptibility to insecticides that primarily target the sodium channel, particularly pyrethroids 
(but also chlorinated hydrocarbons and venom components such as the toxin derived from Androctonus 
australis [Aalt], saxitoxin, tetrodotoxin and the like) in an insect population. 

70 The isolation of the DNA sequence encoding the Heliothis sodium channel provides a number of 
advantages. First, in view of the unexpected high level of homology between Drosophila and Heliothis 
sodium channels, it must be assumed that channels of other lepidopteran species have similar or even 
higher homology to the Heliothis sodium channel. Thus, the Heliothis sodium channel DNA provides the 
basis for isolation of other lepidopteran channels. Such lepidopteran channels can be readily isolated by 

15 hybridization under medium (e.g., 1xSSC, 0.1% SDS, 55°C) or high (0.1 x SSC, 0.1% SDS, 65°C) 
stringency conditions using the Heliothis DNA or portion thereof, to function as an identifiable probe when 
screened against cDNA or whole genomic libraries from the species of interest. Isolation of DNA hybridizing 
under said conditions can be achieved by standard techniques. Lepidopteran species of interest include, 
but are not limited to: other Heliothis species, such as the American bollworm, H. armigera and the 

20 bollworm, H. punctigera ; lepidopteran species of the genus Spodoptera , e.g., the Egyptian cotton leafworm, 
S. litteralis. the beet armyworm. S. exigua : the fall armywornrv S. frugiperda ; the cutworm, S. litura . the rice 
swarming caterpillar, S. mauritania and the Southerm armyworm, S. eridania ; and other miscellaneous 
lepidopterans, e.g., the pink bollworm. Pectinophora gossypiella ; the spiny bollworm, Earius insulana . the 
cotton leafworm, Alabama argillacea ; the leaf perforator, Bucculatrix thurberiella ; the tomato fruitworm, 
-25 Helicov e rp a zea ;- the diamondback moth, Plutella - xylostella ; the cabbage looper r Jnehoplasia ni;— tee- 
imported cabbageworm, Artogeia rapae ; the imported cabbageworms Hellula undalis and Hellula rogatalis ; 
the black cutworm, Agrotis ipsilon ; the corn earworm, Ostrinia nubalis ; the tomato pinworm, Keiferea 
iycopersicella ; the tomato hornworm, Manduca sexta and Manduca guinquemaculata ; the velvet bean 
caterpillar, Anticarsia gemmatalis ; the green oliveworm, Plathypena scabra ; the soybean looper, 

30 Pseudoplusia includens ; the saltmarsh caterpillar, Estigmene acrea ; the leaf miner, Epinotia meritana ; the 
codling moth, Cydia pomonella ; the oblique banded leafroller, Choristoneura rosaceana ; grape berry moth, 
Lobesia botrans ; currant tortrix, Pandemis cerasana ; spotted tentiform leafminer, Phyllonocytes blancardella ; 
grape leafroller Sparganothis pilleriana : tufted bud apple moth, Platynota idacusalis ; red banded leafroller, 
Argyrotaenea velutinana ; oriental fruit moth, Grapholitha molesta ; Southwestern corn borer, Diatraea gran- 

35 diosetfa; rice leafrollers, Cnaphalocrocis medinalis , M arasmia exigua and Marasmia patnalis ; striped borer, 
Chilo suppressalis ; dark headed stem-borer, Chilo polychrysis ; yellow stem borer, Scirophaga incatulas ; 
white stem borer, Scirophaga innotata ; and pink stem borer, Sesamia inferens . 

The isolated Heliothis nucleic acid fragment is also useful in other regards. The newly observed 
homology between Drosophila and Heliothis sodium channels predicts not only substantial homologies 

40 between Heliothis channels and other lepidopteran species, but also between Heliothis and other non- 
lepidopteran insect channels. Thus, the fragment, or portions thereof, can be utilized in developing RFLP's 
for other lepidopteran species, including, but not limited to, e.g., the lepidopteran species noted above, as 
well as non-lepidopteran species such as as the Colorado potato beetle Leptinotarsa decimlineator , the boll 
weevil, Anthonomus grandis ; the Southern corn rootworm, Diabrotica undecimpunctata ; the Japanese 

45 beetle, Popillia japonica ; plum curculio, Conotrachelus nenuphar ; brown planthopper, Nilaparvata lugens ; 
green leafhopper, Nephotettix virescens ; potato leafhopper, Empoasca abrupta ; cotton aphid, Aphis gos- 
sypii ; green peach aphid, Myzus persicae ; sweetpotato whitefly, Bemisia tabaci ; imported fireant, 
Solenopsis invicta ; thrips, e.g., Thrips palini ; pear psylla, Psylla pyri ; two-spotted spider mite, Tetranychus 
urticae ; carmine mite, Tetranychus cinnabarinus ; citrus rust mite, Phyllocoptruta oleivora ; German cock- 
so roach, Blatella germanica ; cat flea, Ctenocephatides felis ; yellow fever mosquito, Aedes aegypti ; and salt 
marsh mosquito, Aedes sollicitans . The generation of useful RFLPs for these species is achieved in 
substantially the same manner as described herein for Heliothis . 

The Heliothis nucleic acid fragment or portions thereof can also be used as a probe, or can be used as 
the basis for designing degenerate probes, to screen genomic or cDNA libraries derived from such other 

55 non-lepidopteran insect species for specific sodium channels from these species. However, given the herein 
demonstrated high level of homology between the distantly related Drosophila and Heliothis , it is quite likely 
that the present Heliothis virescens fragment can be used directly as a probe for identifying resistant 
sodium channels by RFLPs for other lepidopteran and nonlepidopteran species, without the need for 
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isolation of those species' specific sodium channel DNA fragments. 

Continued monitoring and early detection of the presence of a resistance trait in any population is 
essential to effective insect control. By the time resistance is apparent at the gross level, it is very likely 
already at a point where further treatment with the pesticide is doomed to failure. For example, application 

5 of pyrethroids to a population in which resistance is already established will substantially increase the 
selection pressure favoring the appearance of the resistance trait. Whereas, in the absence of such 
selection, the resistant individuals are reproductively less fit than sensitive (wild-type) individuals. Hence, 
resistance would not otherwise have become established in the population without the application of 
insecticides. Thus, selective and timely application of pesticides or recognition of need for alternative 

w application of pesticides at an early stage can be critical in maintaining suitably sensitive insect populations. 
The identification of a genetic trait associated with resistance provides several avenues for tests to 
monitor the occurrence and frequency of resistance in a population at a very early stage, when frequency 
may be low and/or undetectable by standard bioassays. Early observance permits for informed judgments 
in the application of the relevant pesticide. For example, the gene encoding the resistant sodium channel 

75 provides the basis for informative southern or RFLP analysis of an insect population to identify the presence 
of the resistance trait in a given population. Detection of the unique DNA associated with a resistance allele 
(or the presence of two distinct alleles) therefore is diagnostic for the presence of the resistance trait in an 
analyzed sample. This may be determined, for example, by digesting genomic DNA collected from 
individuals of the target population in question and probing a Southern blot with detectably labelled DNA 

20 sequence that identifies a particular resistance trait, or a diagnostic portion thereof, to identify the presence 
or absence of the resistance allele. By "diagnostic portion" thereof is meant any fragment of the hscpl 
DNA which differs sufficiently in sequence from the corresponding portion of the susceptible DNA 
sequence, or a unique DNA sequence genetically linked to the trait, so as to assure its hybridization, under 
high stringency conditions, only with DNA encoding the resistance trait. It should be noted that sequences 

25 flanking the resistance gene; as well as intervening sequences(introns) are particularly suited for identifying 
unique diagnostic RFLPs. 

RFLP analysis also provides an attractive method of analyzing the existence and frequency of the 
resistance trait in the population. As the Examples below show, there is a detectable polymorphism 
associated with the sodium channel DNA between resistant and susceptible individuals. Thus, target 

30 population DNA can be analyzed for the presence of polymorphisms using the detectably labelled cloned 
hscpl DNA as a probe. In this technique, DNA from several individuals in the target population is digested 
with an appropriate restriction enzyme, and size separated by gel electrophoresis. The gel, or a blot derived 
therefrom, is then probed with labelled DNA. either the whole gene or fragment. If there are both resistant 
and sensitive alleles within an individual in the population, there will appear on the gel two different sized 

35 restriction fragments, each of which will hybridize with the hscpl probe, tn this manner, large numbers of 
individuals in the population can be sampled, and the relative abundance of the allele can be determined. 
Identification of the specific DNA fragment associated with resistance, whether by Southern or RFLP 
analysis, will always be diagnostic. 

In this regard, the present invention also provides a kit for evaluating the extent to which a resistance 

40 gene is present in a given population. The kit will contain as its principle components (1) a restriction 
enzyme for digesting DNA, and (2) a detectably labelled probe comprising a nucleic acid fragment capable 
of hybridizing specifically with DNA encoding the resistance trait, or a nucleic acid fragment capable of 
hybridizing with the diagnostic RFLP marker. In a preferred embodiment, the kit also comprises (3) a means 
for extracting DNA from cells of the target population, and/or (4) PGR primers useful in amplifying the target 

45 DNA sequences. Also included may be a set of reference standards comprising sensitive and resistant 
DNA. 

As a specific example, a kit for the detection of altered sodium channels in a population would include 
(1) a restriction enzyme such as Tagl or EcoRI, which will generate fragments which show the relevant 
polymorphism, if present (2) a radioisotope-or biotin- labelled DNA comprising the sequence of the sodium 

50 channel or fragments thereof; and optionally (3) a DNA extraction means. 

It will be recognized by those skilled in the art that variations or components (1) and (2) in particular are 
contemplated. Any restriction enzyme which produces a detectable polymorphism can be used. Preferably, 
the enzyme used will be a 4-cutter, such as Sau96l, ScrFI, Sau3A1, Rsal, Msel, Mspl, Mbol, Hpall, HinPI, 
Haelll, Dpnll, BstVI, and Bfal; or a 6-cutter, such as EcoRI, BamHI, Hindlll, Pstl, and Sail; less useful are 8- 

55 cutters, such as Notl, Stol, Pad, Sse36l, AscI, Fsel, Pmel, Rsrll, or Swal. The utility of any given restriction 
enzyme can readily be determined by digesting DNA known to contain alleles for both resistance and 
sensitivity with the candidate enzyme, and observing the presence or absence of a polymorphism by 
probing with hscpl, or any DNA linked to this region. Also, it will be understood that the "detectably 
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labelled" DNA may alternately be labelled so as to be detectable in any manner known in the art, e.g., by 
chemiluminescence, bioluminescencse, ELISA, biotinavidin, or any other appropriate means. The foregoing 
scheme is useful for detecting the presence of resistance to not only pyrethroids, but also DDT and 
arthropod toxins, such as the sodium channel toxin derived from Androctonus australis (AalT). 

5 Those skilled in the art will also recognize that the approach to resistant pest management described 

herein is not limited solely to control of resistance based on an altered sodium channel. Utilizing target site 
DNA as a means of tracking the presence of resistance in a population provides a far more precise and 
sensitive measure of the prevalence of resistance than do previously utilized methods. The target sites for 
many types of pesticides are now known, and therefore, the proposed genetic analysis for a resistance trait 

10 can be applied to other insecticides as well. For example, acetylcholinesterase is known to be the target 
site for carbamate and organophosphate insecticides (Oakeshott et aL, PNAS USA 84:3359-3363*, 1987). 
Organophosphate insecticides include malathion, methylparathion, diazinon, turbophos and dicrotophos; 
carbamates include sevin, Aldicarb, methionyl and thiodicarb. Target site resistance to some of these 
insecticides has been reported (Karunaratre et al., Resist. Pest. Manag. Newsletter, 3:11-13, 1991; Chen, 

75 Resist. Pest Manag. Newsletter, 2:15, 1990). The acetylcholinesterase gene has been cloned (Fournier et 
al., J. Mol. Biol. 210:15-22, 1989). providing the basis for development of an analogous detection system for 
this type of resistance . Monooxygenase and mixed function oxidases (MFOs) have also been shown to be 
involved in resistance by increase in the rate of metabolism of organophosphates and carbamates 
(Brattstein et aL, Science, 1961349-1352, 1977; Brattstein et aL, Pesticide Biochem. Physiol., 3:393, 1973, 

20 Krieger et al., science, 172:579, 1971; Matsumara, Toxicology Insecticides, Plenum Press, New York. 1975). 
CyclodieneT have beerT~shown to act at the GABA receptor (Kadous et aL, Pestic. Biochem. Physiol. 
19:157-166, 1983; Tanaka et aL, Pestic. Biochem. Physiol. . 22:117-127. 1984); and target site resistance is 
known to exist (ffrench-Constant et aL. J. Econ. Entomoi. 83:1733- 1737. 1990) and the receptor gene has 
been cloned (french-Constant et aL, PNAS USA, 88:7209-7213, 1991). Similarly, methoprene and certain 

25— botanical extracts - (Precocenes) target the juvenile hormone (JH) receptor and resistance to -these- inseeti- — 
cides has been reported (Wilson et aL, Devel. BioL, 118:190-201, 1986; Georghiou et aL, J. Econ. EntomoL, 
71:544-547, 1978; Dyte, Nature, 238:48-49, 1972). Bacillus thuringiensis (Bt) toxins affect a gut associated 
glycoprotein but resistance has not become widespread. Diacyl hydrazine and certain botanical extracts 
(Penosterone A) target the ecdysone receptor (Wing, Science, 241^:467-469, 1988; Spindler-Barth et aL, 

30 Arch. Ins. Biochem. and Phys., 16:11-18, 1991; Cherbas et aL, PNAS USA, 85:2096-2100, 1988) and the 
genes for the ecdysone receptor have also been cloned (Yao et aL, Cell, 71:63-72, 1992; Koelle et aL, Cell, 
67:59-77, 1991). 

The use of this method is also not limited to detection of insecticide resistance, but may be applied to 
any other pesticides, including herbicides, acaricides, fungicides, nematicides, and molluscicides. A number 

35 of genes conferring resistance to herbicides have been characterized. For example, altered acetohydroxy 
acid synthase genes are the basis of resistance to sulfonylureas and imidazolinone herbicides (EP 
Application No. 91 119 254.0; Yadav et aL, PNAS USA 83:4418-4422, 1986). Glyphosate targets the enzyme 
5-enolpyruvate shikimate-3-phosphoric acid synthase, and mutant genes encoding resistant forms of this 
enzymes have been identified (Comai et aL, J. BioL Chem. , 260:4724-4728,1985). Similarly, genes 

40 conferring resistance to the herbicides phosphothrinicin and bialyphos have also been characterized 
(Thompson et aL, EMBO J, 6:2519-2523, 1987; DasSarma et aL, Science, 232:1242-1244, 1986). 

The target site of various fungicides is also known. For example, phenylamide fungicides, such as 
acylalanines (metalaxyl, furaxyl and bevalaxyl), butrolactones (ofurase, cyprofuran), and oxazolidinones 
(oxadixyl) are known to act on fungal RNA N polymerase (Arp et aL, Fungizider. Mitt. BioL Bundesanst 236- 

45 237, 1981; Davidse, Neth. J. Plant Pathol. 87:11-24, 1981; EPPO Bull 15:403-409, 1985). Resistance to 
these fungicides has also been reported (Davidse et aL, J. Plant Pathol., 87:65-68, 1981; Davidse et aL, 
Experiment. Mycology, 7:344-361, 1983). The fungicide carboxin is known to have as a target site succinate 
dehydrogenase (Schewe et aL, in Modern Selective Fungicides, H. Lyr, ed. V.E.B. Gustan Fischer Vertag. 
Jene, 1987). Resistance and cloning of the resistance gene have also been reported (Keon et aL, Current 

so Genetics, 19:475-481, 1991). The blasticidin fungicides, such as BlaS and Blasticidin S act on the enzyme 
nucleoside aminohydrolase; resistance has been observed and the gene conferring the resistance has been 
cloned (Kamakura et aL, Mol. Gen. Genet. 223:169-179, 1990; Kamakura et aL, Agric. BioL Chem., 51:3165- 
3168, 1987). The benzamidazole fungicides, such as benamyl, carbendazim, mocodazole and thiabenazole, 
act by affecting with microtubule function (Clemons et aL, Pesticide Biochem. Physiol., 1:32-43, 1971; 

55 Hammersdag et aL, Pesticide Biochem. Physiol., 3:42-54, 1973). Resistance is also known to occur to these 
fungicides (Van Tuyl, Med. Fac. Lonbouww Ryksuniv. Gent., 40:691-698, 1975); Meded. Landb. Hogesch. 
Wageningen, 77:1-137, 1977); Fanetran et aL, Mycol. Res., 95:943-951, 1991). The relevant resistance gene 
has been isolated and cloned (Jang et aL, Cell Motility and the Cytoskeleton, 17:87-94, 19906; Orbach et 
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al., Mol. Cell Biol.. 6:2452-2461. 1986). 

Other applications of this method will be apparent to those skilled in the art, in view of the following 
non-limiting examples. 

5 EXAMPLES 

1 . DNA Preparation 

Genomic DNA is prepared from adults of an inbred American Cyanamid Company susceptible strain of 
Heliothis virescens as follows. A moth is placed in 400 ml of grinding buffer (0.1 M Tris-HCI, pH 9.0, 0.1 M 
EDTA, 1% SDS) and homogenized with a pestle. 80 ml of 5M KOAc and 400 ml equilibrated phenol is 
added; the sample is inverted several times and left to stand on ice for five minutes. Two hundred ml of ice 
cold chloroform is added, spun at 15,000 x g for five minutes, and supernatant removed. The procedure is 
repeated at least once. 

Four hundred ul chloroform is added to the pellet, the sample inverted for 30 seconds and then spun 
for 5 minutes at 15,000 x g. The chloroform is removed, the sample spun again for one minute and the 
remaining chloroform removed. Two volumes of cold ethanol are added to the aqueous phase, and the 
sample left to stand five minutes at room temperature. The sample is once again spun for five minutes, the 
supernatant aspirated, and the pellet dried. The dried pellet is resuspended in 50 ul Tris-EDTA (10mM 
TRIS : 1mM EDTA, pH 8.0). 

2. Isolation of Channel Fragment from Genomic DNA 

The isolated genomic DNA is used as a template in PCR with primers based on portions of the 

- 25 Dr os ph iia meianogaster para Hocus sodium channeK— - — - 

Specifically, degenerate primers homologous to portions of an exon in the fourth transmembrane 
domain of the a-subunit of the Drosophila para locus are constructed as follows: 



ra 4991+ 5' <T3) GAAATCACTCCCAATTA ATH GAR AAR TAY TTY GT 3' 
ra 5143- 5' (M13-40) TTTCCCAGTCACGAC ATN GCR AAD ATR AAC AT 3' 



35 where H = A, C or T, R = A, G or T, Y = C or T, and N = any base. Numbers refer to 3' terminal base 
positions in the para sequence. Underlined sequences are universal primer tails T3 and M13 -40 
respectively used for sequencing of product. 

PCR reactions of 100 ul are constructed of approximately 1 mg of genomic DNA, 1 mg of each primer, 
0.2 mM of each dNTP, 10 mM Tris pH 8.3, 50 mM KCI, 2mM MgCI 2 , 0.001% gelatin, and 2 U of Tag 

40 polymerase. Reactions are incubated for 5 cycles, each of 50 seconds at 94°C, 2 minutes at an annealing 
temperature of 53° C, and 25 seconds at 72°C, then for 35 cycles with an annealing temperature of 45°C. An 
amplification product of 184 base pairs is obtained, and then directly sequenced using the Sequenase kit 
(United States Biochemical Co.) according to the manufacturers directions. The deduced amino acid 
sequence is found to be the same as for an equivalent region in para . 

45 Genomic DNA is also digested with several restriction enzymes, specifically EcoRI, BamHI, Sail, Hindlll, 
Pstl, and Xbal. The fragments are separated on agarose gel and transferred to a nylon support. The PCR 
product described above is radiolabeled and hybridized to the nylon blot at 60°C overnight. The blot is 
washed with a wash buffer (IMNaPi, 250 mM EDTA, pH8, 20% SDS; Napi = Na 2 HP0 7H 2 0, 134g and 
H3PO4 to pH7.2/liter) at 60°C three times for thirty minutes each. The filter is exposed to film. The film is 

50 developed after 12-24 hours of exposure at -80°C. The results show single bands in each lane indicative of 
a single copy gene. The largest band is for the EcoRI digest. 

Based on the foregoing information genomic DNA is prepared from an ICI America's pyrethroid 
resistant PEG-87 H. virescens strain using cesium chloride purification as described by Ausubel et al. 
(Current Protocols in Molecular Biology, Green Publ. Assn. and Wiley Interscience, 1989), and digested to 

55 completion with EcoRI. This DNA is used to construct a genomic library in the Lambda-Zapll vector 
(Stratagene Co., LaJolla, CA) following manufacturers 1 instructions. The 184 bp PCR fragment is used to 
screen this library by hybridization as described in standard Lambda-Zap II protocols. Several positive 
plaques are purified and the inserts excised in vitro following manufactuer's instructions, and subsequently 
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characterized. A genomic clone designated "hscpl" has approximately 8000 bp, and is extensively 
sequenced. For this first 990 base pairs of coding sequence, there is significant homology between hscpl 
and the published para sequence of Drosophila (Loughney et aL, Cell , 58:1143-1154, 1989). 

3. RFLP Analysis 

Fragments of the gene from individuals of both ICI- pyrethroid-resistant lines and American Cyanamid 
Company susceptible strains (collected Stoneville, Mississippi, 1963) are amplified by PCR using several 
pairs of primers based on the available hscpl sequence. In this specific example, hscp4116 + and 
hscp4399- are used. The PCR reactions, of 100 ml, consist of 100 ng-1mg of genomic DNA, 100 ng each of 
primer (hscp 4116 + , 4399-, as shown in Figure 1) and other components as described above. Negative and 
positive control reactions are also made respectively, without template DNA or with hscpl DNA. 

Reactions are incubated for 30 cycles, each of 50 seconds at 94°C, 2 minutes at an annealing 
temperature of 56°C, and 1.5 minutes at 72°C. PCR products are purified with phenol, chloroform and 
precipitated using ammonium acetate-ETOH. PCR products are then apportioned among three different 
restriction enzyme reactions mixes following manufacturers' instructions (Rsal, Sau3AI ; and Msel, New 
England Biolabs, Beverly MA), and incubated at 37°C overnight. Digestion products are resolved on a 3% 
"NuSieve" (FMC) agarose gel and stained with ethidium bromide at about 50ng/ml. The resulting restriction 
fragments length polymorphisms show a distinct pattern for each of the resistant and susceptible strains 
(Fig. 2) ; indicating the utility of this method in detecting the presence of resistant individuals among a 
generally susceptible population. 

DEPOSIT OF BIOLOGICAL MATERIALS 

The - fottowing -materials have been deposited with the- American Type Culture Collection, -1-2301- 

Parklawn Drive, Rockville, MD, on October 19, 1992 and have been given the following accession numbers. 



Deposit 


Accession No. 


Sodium channel para homolog (3' half of gene) from 
Heliothis virescens ICI strain PEG-87 (hscpl) 


ATCC 75334 
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FIGURE 1 



Heihthis and Drosophila sodium channels. start/end of my sequences, _ gap, * same as above. 3/12/92 pi 

D.mel. 

para ATGACAGAAGATTCCXX^CTCXIATATCT 

! . ♦ * + 75 

MTEDSDSISEEERSLFP. PFTRESLV 

On CAAATO^CAACGCATTGCCGCTt^CATCAAAA 

76 ♦ ♦ * * ♦ 150 

01 EQRIAAEHEKQKELERKRAEGEV 

Dm CCGCGATATGGTCGCAAGAAAAAACAAAAAGAAATCCGATATGAT^ 

151 — * *- — - * ♦ ♦ ♦ 225 

PRYGRKKKQK2IRYDDEDEDECFQ? 
Aincron A /\B 



GATCCTACACTTGAACAGGGTGTGCCAATACCTGTTCGATTGCAGjGCAG ATTGGCCTCTACT 

226 * — * • * * 

DPTLEOGVFIFVRLQGSFPPELAST 



Dr. CCTCTCGAGGATATCGATXTCCTACTACAGCA^ 

301 ♦ *- - * * * 375 

FLEDIDPYYSNVLTFVVVSKGKDIF 

/\C 



CGCTTT?CTCCATCAAAAGCAATGTGC^Tt>C^ 

376 * ♦ * + 

RFSASKAMWMLDPrNPI R R V A I Y T ^ 



Dm GTCCATCCATTATTTTCCCTATTCATCATCACCACA^ 

45l * ♦ * - ♦ 525 

VH PLFSL FI ITTI I V fJ C I L tt I M P T T 

I -SI 

Dm CCCACGGTTGAGTCCACTGAGGTGATATTCACCGGAAT^ 

526 ♦ ♦ * ♦ ♦ * * ♦ €00 

P T V E S TEVT FTC I YTFFSAV KVMA R 
I-S2 

Dm GGTTTCATTrTATGCCCGTTTACGTATCTTA 

601 --- * * ♦ ♦ ♦ 675 

GFILCPFTYLRD AWWWLDfVVIALA 

I-S3 AD 

Dm TATGTCACCATGCGTATAGATTTAGGTAATCTAGCAGCCCTGC^ 

6 76 + - * - — * — ♦ ♦ ♦ * 750 

V V T m r. T D L G N L A ALPTFRVL R A L K T 

I-S4 

SCp 788* AArACnAThGTnGGnGC • > 

Dm ' GTACCCATTGTGCCAGGCTIXSAAGACCATCGTCGGCGCCGTC 

751 - ♦ ♦ ♦ ♦ * 825 

VAIVPGL K TIVGAVIESVKNLRD )L I 

AE I-SS 

Dm ATCCTCACCATCI'IXLTCICT^^ 

826 ♦ ♦ * ♦ + ♦ 900 

TT. TMPfiLSVFALMCLOTYMGVLT E K 

Dm TGGATCAAGAAGTTCCCGCTGGACGGTTL'L* 1 GGGGCAATCTGACCGACGAGAACTGGGACTATCACAATCGCAAT 

901 * ♦ ♦ ♦ ♦ ♦ 975 

CIKKFPLDGSWCNLTDENWDYHNRN 

Dm AGCTCCAATTGGTATTCCGAGGACGAGGGCATCT 

976 ♦ + ♦ ♦ + * + 1050 

SSNWYSEDEGISFPLCGNISGAGQC 

/\F 
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Heiiothis and Drmophila sodium channels. *"* start/end of my sequences, _ gap, ~ same as above. 3/12/92 p2 

D& K ♦ AAy C Z nAAyT Ay GG r. T Ay AC - > 

S F D S F G 

Doyle and Knipple's sequence AGTTTCOATTCATTCGGT 
Dm GACGACGATTACGTGTGCCTGCAGCOG7TT ^TCCGAATCCGAATTATGCCTACACCAGCTTCGATTCGTTCGGA 

1051 ♦ * 1125 

D D D Y V C L 2 G r 1- P N ? N V G Y T 3 F D S F G 



SCp 1153- < - TA 27 3~ GTyCTr AAr ACC 

D&K- < -TA27GrX7iyC7rAArACCCTy 

W A F L S A F ?. - 

D&K TGGCXTITTCXriXriOSGCGTTTCGTCTC 

. Dm TC<Xr<TT7TCGTCTCGtXGT7GCG2- GTCAT 2A t*CC-A:T7~ITXX>AG-~ATCTGTACCACX^^ 

1126 * ♦ - - — 

75 W A F L S A r r. 1, K ? C S F W E D L Y Q I. V L R A 

Dm GCCGGACCATGGCACATG'CTGTTCTTTAT.-. 77 C.ATCA7CTTCCTAGGTTCATTCT ATCTOTOAATTTG ATTTT 

A G ? w h w l f ~ : " : r f t_ c g f v l v ?j :. : :. 



Dr GCCATTG77GCCATGTCG7A7GA7 GA_A7T7 ~ ^.AAGGAA.CA>GCGAAGAAGAAaAC^CTGCCGAAGAGGAG-GCGATA 

127 6 * ♦ - * 135; 

A : '■• A M S ; 2 I L. ; = Y A E E E E A A E E E A : 

Dr. CGTGAAGCGG AAGAAGCTG GGGCC 2 2 G AA.A G 2 GGCCAAGCTOGAGGAGCGGGCCAATGCG^ GAGGA 

i3Si - *- ♦ 

R E A E E A A A A Y. A A K 1 E E P A N A Q A Q A A 

Dm GCGCATGCGGCTGCCGCCGAAGAGG GTGGA GTGCATC'CGGAAATGGCCAAGAGTCCGAC^TATTCTTGC^TCAGC 

1426 - ♦ + ♦ 1500 

ADAAAAEEAALHPEMAKSPTYSC IS 

Dm TATGAGCTA'l "1 TO" I "! , GGCGGCGAGA_-.GGG GAA 2GATC ACAAGAACAAAGAGAAG ATGTCCATTCGGAGCGTCG AG 

1501 ♦ - - * ♦ ♦ 1575 

yelfvcge:<g::ddnnkekmsi rsve 

Dm GTGGAGTCXX^GTCXXnXlAGCGTTA.TAC^ 

1576 ♦ - ♦ — ♦ * * 1650 

VESESVSV:C?C?AP7TAHQATKVR 



Dm AAAGTGAGCACX7TACACGATACGGA_ACGGA.2G7GGCCGCTTTG 

1651 - — - * 1725- 

35 kvstytir::g-crfg:pgsdrkplv 

/\alc. exon A ^3 bp 

Dm TTGTCAACATATCAGGATGCCCAGCAOCA2TTG 

1726 ♦ ------ * ♦ ♦ -♦ 180C 

lstyqdaqch: pyaddsnavtpmse 



50 



GAGAATG<XXXXATCATAGTCCCCOTGTACTATGCCAATCTAG 

1801 * - ♦ X875 

ENGAI IVPVY V GNLGSRHSSYTSKQ 

Dm 'iMCCGAATATCGTATACCTGAGATG: Ga--2TA.2TCGGTO0CATGGCCGTCATGGGa7IX^ 

1876 ♦ * ♦ * * 195C 

SRI SYTSHGDL.LGGMAVMGVSTMTK 

Dm GAGAGCAAATTGCGCAACTGCAACAGAGGGAATGAATCAGTGGGC^ 

1951 — ♦ — — * — * * 2025 

ESKLRNRNTR ::QSVGATNGGTTCLD 

Dm ACXZAATCACAAGCTCGATCATCXX^ 

2026 ♦ + * ♦ ♦ * * + 2100 

TNHKLDHRDYEIGLECTDEAGKIKH 

Dm CATCACAATCCTTTTATCGAGCCCGGCCAGA^^ 

2101 -- - * * — - * 2-75 

HDNPFIEPVQTQTVVDMKDVMVLND 



55 
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Figure 1 



5 Heiiothis and Drosophila sodium channels. start/end of my sequences, _ gap, " same as above. 3/12/92 p3. 

On ATCATCGAACAGGCCGCTGGTCGGCACA^ 

2-76 ♦ ♦ * * ♦ --- * 225C 

1 IEQAAGRHS?. ASCRGVSVYYFPTE 

' \H <-- alt exon B / \ I 

Dr, GACCATGACGAC^ATGGGCCGACCrrrCAAAGA^AAGGCACTCC^ 
10 2251 ♦ -- - - -* - 2325 

2 D D E v G ? T F K C K A L E V I L K G I D V F C 

an GTGTGGGACTLiTTGCTCrGCi' : ' 1 * 1 lid' H XjAAATTTCAGGAGTGGGTAT Z C-ZTCATCir 1 *L'l ' ! 'CGATCCCriX.'Lt' I *CGAG 

2326 * * 2400 

v w D C C w V W L K F Q £ w V s : . T V r n P g y v 

::-Sl 

75 

^r. CTCTTCATCACGKTIXriXjCATTGTGGTCA^ 

L F T T L C T V V N * M r * A V T HHDMNKEK 

Ctt. GAACGCGTGCTCAAGAGTGX3CAACTA I ' l ' iv) ' . --.CCGCCAC^.V: L- C2ATCGAGGC CA rCATGAAGCTAATGCCC 

2476 * - - 25SC 

20 z r. v l y g c n v f f t a - - ^ - e - y y. l y a 

::-s2 

lY 1 " ATGAGCCC'ZAAGTA\_"I A I L '"i"CCAGGAGGGs-". GGAACA7\_v. GA\_"I j . 2ATTATCGT GGCC V I ATCGOI A ! * : C-GAA. 

2 - . . „ . . . 25 2 i 

y S P Y. Y Y F Q £ G W N ; » " F Z Z V A L £ L L E 

II-S3 

-25" - . — - - ...... 

Zkt CTGGGACTCGAGGGTGTCX2AGGGTCTG?CCGTATTGCCrrT^ 

2626 ♦ 27CC 

? G T. F C V Q G L S V I. P S e P • 1 R V F K L A X 

II-S4 /\J 

DtP TCTTGGCCGACACTTAATTTACTO\TTTC 

30 2701 - — ♦ 2775 

S W P T L N LLISIMCRTMGALC N L T F V 

II-SS 

DtP CTTTOCATTATCA'IVrftJ VUVl T CGGAAACAATTATCATGATCACAAGCAC 

2776 *• * - * ♦ * 2850 

L C I I I F I FAVMGMOLFG K N Y K D H K D 

35 '™ 

Drr CCCTTTCCGGATG<^CACCTGCCCCO<rrra 

2851 ♦ ♦ -* — * ♦ 2925 

RFPDGDLPRWKrTDFKHSFMIVFRV 

Dn CTCTGCGGAGAATGGATa^GTO^TCrrCK^ 

40 2926 * * ♦ -* * * ♦ 3000 • 

L C G E W I ESMWDCKYVG3 VSCTPFFT, 

II -S6 

Dm GCCACCGTTCTCATOGCCAA lVnUH^ A 

3001 ♦ ♦ ♦ — * 3075 

ATVVTGNLVVLNLFLALLLS N F G S S 



45 



50 



Dm AGCITATCACCCCa^CTGCtGATAACCA™ 

3076 ♦ * ♦ ♦ ♦ * ♦ 3150 

SLSAPTADNDTNKIAEAFNRIGRFK 

Dm AGTTtX^GTT AAGCtH'AATATTGCTGA'l ' IX/ 1*1*1 W^GTTAATACGTAACAAATTXIAC AAATCAAAT AAGTGATCAA 

3151 * <• ♦ ♦ ♦ * 3225 

SWVKRNIADCFKLIRNKLTNQISDQ 
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FIGURE 1 



Hetiothis and Drosophiia sodium channels. ••• start/end of my sequences, _ gap. " same as above. 3/12/92 

Dm CCATCAGAG^TGtTTGACAACGAACTGGAGCTCGGCCACGAC 

3226 *- --- * --* * 3300 

PSEHGDNELELGHCEILADGLIKKG 
/\ alt. exon E 39 bp 

Ir. ATCAAGGAGCAGACCCAACTGOAC^rTG^ 

33C1 * ♦ 337S 

I KEQTCLEVA ICDGVEFTIHGPMKN 

?rr. AACJ^GCCGAAGAAATCCAAATATCTAAATAACGCAACC 

3376 ♦ ♦ * ♦ :r,rrm L 

NKPKKSKYLNNAT 

START HSCP1 CLONE 

scd£l pS-S ££fi£I* * ' AAlTCACTATa CCAGGTAAL'I" I V i ' IvjATACCTA 

SCd6I GTTTAAAA7AAGATA CI V» ' I T ATCTAATAG^^TTTTAAGAGTTXTTCATAAACtTrAATGTT AA' J * 1 * I ' l * 1 CAGGCG 

3Cd£ I ACAATAAATACAAGAAAGGgcAAAA iV!"!\, 3 ' 1 AAATAAT ATT AACGCA wt aACAGAT AATCAT AGAGACAACCGT 

3cd£ . TTACACTCTCAATTAAATCATCACG0OT ATCCTATACAGGT A/- A?A' "'."" J /JGTCACAOCTT kCTAATAAATCAC 



. . . CATAA - > no in~ron 

_HKI rs_fk_de_skkg_sadt: DG? ? ? K D„ 

SCC61 " CATAAAA t CAGGTCGTTCAAAGATC-* AAGTCa T Aa AGGTTCC GC Ag AC ACGAT AGATGgCGa mgmG rmGAAGG AC 
Dn CATAAGAATCtACCATTCAAGGACGAC^GCCACAAGGGCAGCGC 

--♦ * * * * 3S2S 

HKNRPFKDESHKGSAETMEGEEKRD 
Aintron M 

ASKEE LGLEEE. . 
scd6 1 GCTaGTAAAGAGGAATTGGGTTTAGAAGAAGCJXCfiGTGT^ 
Dn GCCAGCAAGGAGGATTTAGGTCTCGACGAGG 

3526 ♦ ♦ * no incrcr. 

ASKEDLGLDEE-. 

scd61 CCATATTTGGA 

--MVEEEEDGKLDGG LG K 
S cd6 1 CA&TTTGCATATAATTAATCTCTTACA£AAATCCT^ 

Dm AACTG<^CGA.GGAGG<jCCAATGCGAGGAGGGCCCGCTCGACGGT 

♦ * - * 360C 

..LDEEGECEEGPLDG 

TDI IVAADEEVVDD3PADCCPEPCY 
5 -ef 1 ACAGaCATTATACTGGccCCACatGAACAACTTCTT^ 

Dn GATATCATTAT*TCATGCACACt^CGAGGATAT ACTCC ATGA.A T ATCCAGCTGAT PGC rCCCCCGAT'I'CGl ACTAT 

3601 ♦ -♦- «• * * 3675 

DI I IHAHDEDI LCEVPADCCPDSYY 

AKFPFLVGDDESPF WQGWGMLRLKT 
scd61 G<XAAGtTTCCATTCCTIXrrGGGTt^ 

Dm AAGAAATTTCCGATCTTAGCCtXTrcACGATGACTCDCCGTTC^ 

3676 * ♦ * -* 37S0 

KKFPILAGDDDSPFWQGWGNLRLKT 

FKLIENTYFETAVITMILLSSLAL 
scd6 1 TTCAAACTCATTGAGAACACATATTTcGAAACCGCTGTG^ 
Dm TTTC^TTAATTGAGGATAAATATTTTCAAACA 

3751 ♦ ♦ ♦ * «• * ♦-- no incron 

F R L I F.PKVFETAVTTMI L M S S L A L 

III-S1 



12 



EP 0 615 976 A1 



FIGURE 1 



70 



75 



45 



Heiiothis and Dmophiia sodium channels. "* start /end o* my sequences, _ gap, " same as above. 3/12/92 p5. 
scd61 ^GTTCTCAAATAA 

A 

scd6 i TTTTCTC^CAcTTTCTTTCA^TACT^^^ 

Dm GCA 

r.o intron 392$ 



LEDVNLPHRF 1 L C. D I LYYMDRI F T V 
scdfi 1 TrAGAAGATCTAAATTTACCACATCGACCGATrCTTcAAGATAT C ' ri ' 0 1 ' A TTATATGGATCGGATCTTCACCGTC 
Dtn TTAGAAGATGTACATCTGCCACAAAGACCCATACTGCAG^^ 

3326 + - * * * * ~? 

ledvhlp^rp:l;d tlyvmpp.ic-tv 

III-S2 

hscp-3868- <-CCACGACC. . . 

SCp 3975* "4229W39S5 + * ACnAAv-GCr.TGGTGyTGG - > 

IFF " E K L I K W L A 1 * F P K Y £ T N A W C W 

3 cat I ATTTTCTTCATCGAGATGTTGATCAAATGG rTTGCCCTTGCCTTCCAGAAATACTTCACAAATtX^ 

Ztr. AT ATTCTT^TTGGAAATGTT AA7CAAG7 Z 77TGG - G 77 C 3G CTT CAArtCTGTACTTG ACCAACCCGTOGTGTTGC 

IFFLEWLIKWlALGr K V Y L T N A W C W 

II ■ -S3 



is cp 3 5 5 S - GAC-CTGAAGTAGTA 



s cd 5 1 CTCGAC7TCA7CAT7G7CATG ^*=A7A77. 1 .77A7AAA7 A 7 A7T7G 7 1 777G7ATCA7TTGAAC7 AACA GTTTC 77 
Dn CTCGATTTCCTXATTCTCATG 



25 ■ " _ - . - - — 

scd6 1 TGCAG ATT AGATTGGTAAAACG7 AGATT A"T g ATT AT c C- AA TTTXl^CTTtTTAAGTTCTGT ATAAT<7TGAAAG AC A 

scd6 1 AAATTAAgGTTCAGGTCGCTC\ ' : " ! XJAAGT77ATCCT7 c 70 cCTCTCAGCGAGGTAAAGCTGGGAAGAATAATTTA 

scd6 1 TACAGTGTTAAGTATACCTAGA7GTAAGGAA7ATATTG7A7A:^AAAGTAAA 

scd61 GTCGCTcCrTAAACCACgGnGO.GTGATgsTGGcg sGACg ACATCCCnCTTCcGCTCGAt GcACgTTGngng cGCT 

scd6 1 CCOGcfTCCGc<XXSGrCTCTCGCTg(^^ 

SCd6 1 TXXTrATCGTTGCGCTGCACATCCACACGA' i ' 1 VrWCACTCTAC I TI VIXXnTSAGAAATCAGTGCAACATCCTGTT 

scd61 GCT AATCGAAATAAGCAACCAAACL' 1'IXA. L-ACAGAGA , Y , 1 : \ ATCIXTGAacCACt TTGTGAAATGTGAACTCTGA 

30 scd6l TTCATATTCAACTAA 1 V1X.MTAATAAA Xj I ' / :XJU * I VI ' AAATA' i \ ' l " ICl ' A ATTCTACT<n Vl * ritj AOGTGCAGCGCA 

scd61 ACTCAAACXTXTIXX^GCTTTGATTCTTCGA^ 

scd6 1 AgcTCTCGATGTCGTA'i'IV/i"; ^ /1TATC-7 AAAcCGCTTCA Z GTGACTCTATAACCCACGACCcccGCTATATGA 

scd6 1 ATACCTCTGrCarTATATATaAA^ 

scd61 AC tT IVT i CTAT lTiV^ I *G C 'I Xj '7 7 G t GcCAGAT AGTG O.- . . * . ^ CAGGTAGTACTCACATTACCCACATCTGTCGGG 

SCd6l GATAAAATCCGA*ATTTCAAAGAAAAGCT77W 

SCd6 1 ACAAACX^AAACTAT AAATACCTAT ACTAT A7ACAT AT7TTTGAT A ,\WI"!VI T AAT ATGATGTGATGTAGCt TT 

SCd6 1 ATTTTAgCGACATCAGAGAAACOGTAGCCTAAGCTCAAAATTAGAG ^ ' I * 1 * I V1XJ1 A AAATCAATCwTiJ H AATTGC 

35 scd6l TATATAATT A TI * IVLATI ' lt- * I ' i ' 1 ' ! ATTCTCTGATC VyCy/TTiAAr kwAmyTCX^TGTAACCTTAT<rrGT'AACTTGA 

SCd6 1 GTGAATATCAC\rrTCX7TATCCCTCI^TTATGC 

scd61 TTCTTTATAGTATCATATCCTTAGGTTTGTA* * * 

GAP IN HSCP SEQUENCE 

40 7 7ATCGC7"T A7CAA 777 rtTTGCTTCACTTGTTGGAGCT^ 

intron N * + + ♦ 4050 

V s i : N r ft s r. Y C A G G I 0 A 

III-S4 



Dm TTCAAGACTATG<XSAACGT^AAGAGCACTC^AGACCACT.-.^GT 

4051 ♦ * * no intron 

FKTMRTLQAL= 2 \ = 



HSC 4116+ "Fred" TG AGCCGCATGCAGGGCATG - > 
HSC 4105- "Jenny' < - ACGTCCCGT ACTCCCATGCA 

AMSRMQGMR intron? 

scd7 2 * * * ATTAOTXriTCAAAAGa^TGCGAAGCTGG^ 

Dm CCCATGTCCCGTATGCAGGCCATGAGG 
50 no intron * ♦ intron O 

: A M S R M O G M R 
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FIGURE 1 



Hetiothis and Drmophila sodium channels. "** start/end of my sequences, _ gap. * same as above. 3/12/92 p6. 

s cd7 2 ACCACCCTGTCCTGCCGACAACACCCTa t cgCTCATCCATCCACCACACACTTCGCTCCACAC^^ 

srd72 TTCTATTTCAAL T XTACGATCA r^ 

s cd7 2 C CCATA rnV I XX ^TsAATCACCGGATCAAAA r A 0 . ' I ' l ' I ' l AATACTTAATTTGCACAGTTATCrCATTCATTC^ 

SCd7 2 AGT AGTCGATTGAAGT AATTATT AGTG AATCA ! ' , T , \J kACTGGTCGOTGGCAC C 3 CTGAATGCCTTAGTATCATCA 

scd72 CTGTTCGTCATAAACCTCTTTTACAAACK^^ 

SCd72 TTT Z CT ATTGGTCTTATT ATT AGCT AGATTAGAL' I ' ^ \ * ! X7TAATT ACTT AGTTATTTGGAATGCT AATTT ATATTCT 

scd7 2 GCACCTTAGATTTTTTCTTCTTGTATCTTCATCC.-. • ♦ • 



GAP IN HSCP SEQUENCE 



scd!31 



• * * GCTAACTCCTACATAGTTACTCCACAGTATTAATGACA 



TTAAOTrCCTTATATCCCAJ^AATA^ 



?20f4/li 



ZZAATATATCr 



412S 



V 



HSC 4211* 



scdl31 
P20m4/ll 
P20C4/11 
Dm 

4126 



CTGATCTTC_. 
' I F W L I F 
TATCTTCTCGCTGATCTTC 



"V "N A L V *Q A I ? S I F N V L L 
GTAAACGCTCTCGTGCAAGCGATCCCCTCCAT 

A 

A 

CTTAATGCGCTGCTACAAGCTATACCGTCCATCTTrAATCT 



N A 



V , Q 



V I L 



T F W I, T Z 



III-S5 



4211 
HSC 423 
Rr.O 8- 
SSO 8* 

scdl 3 I 
P20m4 / 1 : 
P20f4/i: 
Plm24/9 
Dm 

420] 



. GCCATCATGGG - > 

5. -421S*" ACAACTCTTCGCTGGinAAATA - > 

CAAA7A7TTCAAGGT 



AAAATATTTCAAGG7AAGCAG - > 
AlMCVQLFAGKVrK 
GCCATCATC^GAGTACAACTtTITCGCTCGC^^ T 



GCCATAATGCGTGTACAGCTTTTTGCTCGAAAA7 



TAAG 



a V O L F A G 



rAATTTATTAACATAACAAAAA 



•AGCACTA" B GT" 
•AGCAGTA" -GT* 



HSCO 5 2 - < - T AGAAT AATCA . . 

scdl 3 1 AATATTTCAATTCGTAAAATCTTATTAGT 
Plm24/9 r "' 



. . .CACAAGTTTTA 
sedl31 (^M^/ITCAAA A ' lTlXj rAACAT O , riM 'T , I L , rri , U i\ T. 
Plm24/9 C T C*T* 
Cm 



CVDLNHTTLSH 
r^A^TGCGTCGACCTCAACCACACGACGTrGAGCCAC 



TGCGAGGACATGAATGGCACGAAGCTCAGCCAC 

=n P ♦ ♦ ♦ 4275 

CEDMNGTKLSH 



HSCP4 343* TGGGAGAACTCACCGATGAACTT - > 

HSC 4325* 14335*) ATCTTAG ACAACTACACCTGGGA - > 

EI I PDRNACILENYTWENSPMNFDH 
scdl 3 1 GAAATCATTCCAGACX33CAATGCCTGCATOT 
Plm24/9 G A 

On GAGATCATACCAAATtTGCAATGCCTGCGAGAGCGAG 

4276 * -* ♦ * * -* «- * 43S0 

EI I PNRNACESENYTWVNSAMNFDH 
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Hetiothis and Dwsophila sodium channels. "* start/end of my sequences, _ gap, " same as above. 3/12/92 p7. 

HSC 4394- *Heloise" < -TCCCTACCTATG*PCTACTAC 

HSC 4415* *466S*' AGGGATGGATACACATCATGAA- > 

hsc 43 99- "liz" < - acctatgtctagtacttgctgcg 

vgkaylclfqvatfkgwiqimnda: 
s cdl 3 1 gtcggcaaggcgtatctctgc t, ' 1 u ri ^caagtggccaccttcaagggatggatacagatcatgaacgacgctatt 

Flm24/9 

Dm CT AGGTAACOCCT ATCTGTGC- 1 ■ i 1 IMCCAACTGGCCACCTTCAAAGGCTGGAT ACAAAT^ 

43S1 - * - ♦ - 4425 

V G M A V L C L F 0 V A T F K G W I Q I M N D A I 

D S R E 

scdl 3 1 GATTCGAGAGAA GTATC GCTACTA'IT 1 Vl'l" 1* I 'CL' 1 Tl' 1 V 1' ICATAAGTTCATAAATTAATATCAATAAAAATAT'C 
On GATTCACGAGAG 

440* ir.trcr. ? 

75 D S R E 

s cdl 3 1 ACGCAATACAATAAA7GATAT 

V G F Q F I RETNIYKYLYFVFF: 
s Cdl 3 1 TC7TAATGCCA£GTCCOCCGGCAACCTATACG^ 
2C Drr CTTXXl^CAAGCAACCAATTCGT GAAACGAACA7C7 ACATGT A 7TT AT ATTTXTGT ATTCTTCA7C 

ir.tron C * * * * ♦ 4500 

v r k ; ? : = i t n : v v •= : - v r ■ ? f : 

:::-s6 

IFGSFFTLNLFIGVIIDKrKEQKKK 
s cd 1 3 1 AT ATTTGGCTCATTCTTCACTCTCAACCTATTCATC^ 

25 Dm - " ~ ATATTTGGATCATTTTTCACACTCAATCTGT^ 

4501 — + * — * * * * 4575 

TFGSFFTI. NLFTCVT I D NFNEQKKK 

A GSLEMFMTEDQKKYY H A M K K M G S 

scdl 3 1 GCT__GCCAGCcTrCAGATGTTCATGACTGAGCAC^ 
30 Dm GCAGGTGCATCATTAGAAATCTTCATG^ 

4576 ♦ * ♦ ♦ ♦ «- * + 4650 

AGGSLEMFM TEDQKKYY5AMKK M G S 

PKC acciv'r. site West ez al Science 254. 866 

KKPLKAI PRPK"' 
scdl3 1 A^AAAACCTTTAAAAGCTATCCCCACACCCAAC^ 
35 Dn AAAAAACCATTAAAAGCCATTCCAAGACCAAGG 

4651 ♦ ♦ intron R 

KKPLKAIPRPR 

scdl 31 TATCCAAGGAGGAGCGAGTCTTATATITGAAACTrGATAGTAATATT^ 

scdl 3 1 CAGTACTGCGGTAAACT^TTGTTTTCAACGCCAGAAA GCCTA 
scdl 3 1 G AATCTATTCTAAGATTGATrTGGAGCCGTCCAL''l ' 1 l.C CA AC^ACAGTTGCAGCATCT ATCOCACCGGACCACGT 
40 scdl 3 1 CGTTGTACCCAGATAAGAAAGCTTTCTACC 

WRPQAIVFEIVTDK 
TAAATAAACACTAACTGAAACTGTTT^rTTCCAjSTGGCGG 
Dm TGGCGACCACAAGCAATAGTCTITGAAATAGTAACCGATAAG 

intron R ♦ * ♦ ♦ 4725 

WRPQAIVFE T V T O K 

IV -SI 

KFDMI IMLFIGLNMLTMTLDHYQQS 
scdl3 1 AAGTTCGACATCATCATCATGTTGTTCATCC^ 
Dm AAATTCGATATAATCATTATGTTATTCATTCG^ 

4726 * * ♦ ♦ --♦ * * * 4800 

KFDIIIMLFIGLNKFTMTLD R Y D A S 
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10 



Heikythis and Dmophila sodium channels. "* start/end of my sequences, _ gap, " same as above. 3/12/92 p8. 

HSC 4834- (8/10/90) < - TACT AT AACT AGCACT AT AAGTC 

ETFSTVLDYLNMIFIVIFSSECLLK 
scdl3 1 GAGACCTTCAGCACTGTCCTCGACTACCTCAACATG^ 
Dm GACACGTATAACOCGGTCCTAGACrrATCTCAATC 

48C1 * * * 4875 

D T Y N A V L " Y *; A I r V V " r S 5 F C L 1 Y. 

IV -S2 



75 



MFALRYHYFVEPWNLFDFVVVNFSI 
scdl3 1 A'lX mUAX TlACOCTACCATTA ClTlVr^ 
Dcr. A7ATVCGC7TTACGATATCACTATTTTA7TGA 

4876 + ♦ * ♦ * * 495C 

I F A L R Y H Y F I EPWNLFDVVVVI L S : 

IV 23 



L S. . 

s cdl 3 1 CTTA222o<n , ATTTTGGG7 
Or? TTAG 

4951 intron S 



TATTCCAATAGTAAA J iV/l " / 1 CCATTTATAAT*!TACTAATGATACACTC 



20 



~25~ 



SCpu 4991* (5246~j 



T3^TnGAxAArTA>-TT7~ - 

T L ' I 'i' : I . ' 1 1 ' I ' 1 "OCT ATTt^GTGAT ATTATAGAAAAAT AT"~~ 

Dr. G'l V:\CTACTTAGCGATATTATCGAGAAGTA~ 

incror: S * * * 

. LVLSDI I F._ K Y F V__S P 



3 r T L L F. V V R V A 
"7CACCC ACGTT ACTOA GGGTGGTGAGAGTAOCC 
"CGCCGAC CCTGCTCCG AGT^GGTGCGTG TGG CG 



L R V V P. V 
IV-S4 



30 



HSC SC97* (53 S0*» TTGTTCGMGCTGGCCAT - > 

HSC 5083- < - AAGCGCGACCGGTACAGTGA 

HSC 5095- <-TACACT. . . 

KVGRVLRLVKGAKGI RTLLFGLAMS 
scdl 3 1 AAG<rrCGGTCCTGTGTTOCGTCTCGTCAAGgG^ 
Dm AAACT<XX X tJ O "I t/ 1 U/ ITC l^ ^^ 

5026 ♦ ♦ ♦ ♦ ♦ * ♦ + 5100 

* V « PV*-3 f. V K G A V T- T B_T LLFALAMS 



35 



HSC 5 G 9 5 - . GACGGTCGGAATAA 

SCpu 5169* (5426+) 

SCpu 5143- (5430-/5218-) 

LPALFNICLLLFL 
s cdl 3 1 CTGCCAGCCTTATTCAACATCTCTCTGCr^^ 



T3 *- GCnAThTTyGGnATG - > 
< - TACAArTAdAArCGnT A& . Ml 3 . - 4 0 
VMFIFAIFGMSF 
GT^^TGTTCAT^ITKrGCCATCTTCGGCATGTCGTTC 



Dm CTGCCGGClXrl VIM 1AAGM VI t^CcTuCTOul \jt J !\_vi GGTCA i vl" I\ATur n -uu^A v rri vuGCATGTCGTTC 

5101 ♦ — - ♦ - ♦ — --- 5175 

L P A L F N ICLLLFLV MFIFAI FGMSF 
IV-S5 



40 



FMHVKDKGCLDDVYNFKTFVQSMI L 
scdl 31 TITATGCACGTCAAAGACAAAGGTGGTCTCGACG ACG7 3TACAACTTCAAGACCTrCGTGCAGAGTATGATCCTG 
rtn TTCATGCAarrGAAGGAGAAGAGCGGCATTAACC^CCrrC^ 

5176 + ♦ ♦ * * + * * 5250 

F M H V K E K S U I I* C V Y N F K T F G Q S M I L 



45 



L F Q 

scdl 31 CTATTTCAGC3!CAGTGTrACTAATCATACTTTAGCGCCTCC 
Dm CTCTTTCAC 

5251 intron T 

L F Q 



50 



scdl31 
scdl 31 



CCAGGC^TTAl'l VXjTTC AAAlTGA AAGT TAA' l ' 1 ' 1 MM 'AGCCGTICAAG<ATCrAGTGTATGCTAATCTO M I V I'I' ATC 
ATCAAACACAGAGTGAGGTTGTTAATTTATGTGTT 
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H civ this and Drosophila sodium channels. — start /end of my sequences, _ gap, " same as above. 3/12/92 p9 

SCpu5285+ (5540WS269+) T3 ♦ TCnGCnGGnTGGGAyGG - > 

MSTSAGWDGVLDGI I NE EECDL 
s cdl 3 1 GTTTGTC&G^TGTCGACCTCNGCCG^ 
Dm ATGTCGACGTCAGCCG<nTGGGATG<jIX7r 

incron T ♦ ♦ ♦ * * ♦ ♦ 5325 

MSTSAGWDGVLDAIINEEACDP 



PDNERGY PG NCGSATIGITY LLSYL 
SCdl 3 1 CCGGACAACGAGCGCGGCTACCCCGGCAACTGCGGCTCTGC^ 
an CCCGACAAC^ACAAAGGCTATCCGGCCAATTGTCCTrCACCGA Ll^ 

5326 + + * -* ♦ ♦ ♦ + 5400 

lt3 PDNDKGY PGNCGSATV GITFLLSV'_ 

IV -S6 

SCpu :>4^5- (5712-) <-7TrTACATTTAdCGnCAgagtgaccagnagcaaa 

visflivinkyiavjle'nysqas* 

scdl 3 1 GTCATCTCCTTCCTCATCGTCATCAACATGTAC^ 

Orr. GTT AT AAGL" 1" \'l'i m i AGTT ATTAAT ATGTACATTGCTGTCATTCTCGAC1AACGGAATTC 
20 5401 * * * *- 5461 

V Z S F 1 I V I N V! Y I A VILE N G I 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: American Cyan amid Company 

(ii) TITLE OF INVENTION: Method for Monitoring Pesticide 
Resistance 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: American Cyanamid Company 

(B) STREET: One Cyanamid Plaza 

(C) CITY: Wayne 

(D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07470-8426 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 93 118 061.6 

(B) FILING DATE: 08-NOV-1993 

_ (C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Wachtershauser Dr., Gunter 

(C) REFERENCE / DOCKET NUMBER: EA- 9088/31, 732 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (089)293906 

(B) TELEFAX: (089)223759 

(C) TELEX: 5214173 Patw-D 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AATTCACTAT ACCAGGTAAC TTTTTGATAC CTAGTTTAAA ATAAGATACT GTTGTTATCT 
AATAGGATTT TAAGAGTTGT CATAAACGTA ATGTTAATTT TTCAGGCGAC AATAAATACA 
AGAAAGGGCA AAATTTTGTT AAATAATATT AACGCAWTAA CAGATAATCA TAGAGACAAC 
CGTTTAGACT GTGAATTAAA TCATCACGGG TATCCTATAC AGGTAAATAT TTGTCGTCAC 
AGCTTKCTAA TAAATCACAA TCAAGTTTCT GTACTAAGAA CACAATTTCT CGTTTAGGAT 
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GACGATACAA 


TTAGTCAAAA 


ATCGTACGGC 


AGTCATAAAA 


TCAGGTCGTT 


CAAAGATGAA 


360 




AGTCATAAAG 


GTTCCGCAGA 


CACGATAGAT 


GGCGAMGMGM 


MGAAGGACGC 


TAGTAAAGAG 


420 


5 


GAATTGGGTT 


TAGAAGAAGG 


TCAGTGTAAA 


ACTGCAATTN 


AAAATTAACA 


GAATTGAACT 


480 




AAGC CAT ATT 


TGGACAATTT 


GCATATAATT 


AATGTGTTAC 


AGAAATGGTT 


GAAGAAGAGG 


540 




AAGATGGGAA 


GTTAGACGGA 


GGTCTAGGCA 


AAACAGACAT 


TATAGTGGCC 


GCAGATGAAG 


600 


10 


AAGTTGTTGA 


CGATAGCCCT 


GCTGACTGCT 


GTCCAGAGCC 


ATGTTACGCG 


AAGTTTC CAT 


660 




TCCTTGTGGG 


TGATGATGAA 


TCTCCCTTTT 


GGCAAGGCTG 


GGGCATGCTT 


CGGTTGAAAA 


720 




CCTTCAAACT 


CATTGAGAAC 


ACATATTTCG 


AAACGGCTGT 


GATTACAATG 


ATTTTGCTCA 


780 


15 


GTAGTTTGGC 


TTTGGTAAGT 


TCTCAAATAA 


TTTTCTGAAC 


ACTTTGTTTC 


ACATAGTAAG 


840 




GGAGCAAATT 


ATGTTCATGA 


CGAAACTTYK 


CTGTCTTTAC 


AGGCTTTAGA 


AGATGTAAAT 


900 




TTACCACATC 


GACCGATTCT 


TCAAGATATC 


TTGTATTATA 


TGGATCGGAT 


CTTCACCGTC 


960 


20 


ATTTTCTTCA 


TCGAGATGTT 


GATCAAATGG 


CTTGCCCTTG 


GCTTCCAGAA 


ATACTTCACA 


1020 




AATGCG7GGT 


GCTGGCTCGA 


CTTCATCATT 


GTCATGGTAA 


TATTACTATA 


AATATATTTG 


1080 




CTTT CGTATC 


ATTTGAACTA 


ACAGTTTCCT 


TGCAGATTAG 


ATTGGTAAAA 


CGTAGATTAT 


1140 


25 


GATTATGGAA 


TTTGAACTTG 


TAAGTTCTGT 


ATAATGTGAA 


AGACAAAATT 


AAGGTTCAGG 


1200 


TCGGTCTTTG 


AAGTTTATCC 


TGCCGCCTCT 


CAGCGAGGTA 


AAGCTGGGAA 


GAATAATTTA 


1260 




TACAGTGTTA 


AGTATACCTA 


GATGTAAGGA 


ATATATTGTA 


TACTAAAGTA 


AATGACGATT 


1320 




GGTGTGGCGT 


TAGTTGTCGC 


TCGTAAACCA 


CGGNGCAGTG 


ATGSTGGCGS 


GACGACATCC 


1380 


30 


CNGTTCCGCT 


CGATGCACGT 


TGNGNGCGCT 


GCGGCTCCGC 


GCGGTCTCTC 


GCTGGGAGGG 


1440 




CATGCGCGTG 


AGTAGGACGG 


CACACCACTC 


GTGCGCAGGC 


TGTGTTGGTA 


TCGTTGCGCT 


1500 




GCACATCCAC 


ACGATTGTTT 


CACTCTACTT 


TCTGCTGAGA 


AATCAGTGCA 


ACATGGTGTT 


1560 


35 


GCTAATCGAA 


ATAAGCAACC 


AAACCTTCCG 


ACAGAGATTT 


TTATCTCGAA 


CCACTTTGTG 


1620 




AAATGTGAAC 


TCTGATTCAT 


ATTCAACTAA 


TCTCTTAATA 


AAGTTTGTTG 


TAAATATTTT 


1680 




CTAATTCTAC 


TGTGTTTGAC 


GTGCAGCGCA 


ACTCAAAGCG 


TGCAGCTTTG 


ATTGTTCGAT 


1740 


40 


GTCTATGGCA 


GTGGAAACTC 


CGAACGGCCT 


CACCTCGCTG 


CCTCGAGCTC 


TCGATGTCGT 


1800 




ATTGTTTGTT 


TATGGAAACC 


GCTTCATGTG 


ACTCTATAAC 


CCACGACCCC 


CGCTATATGA 


1860 




ATACCTGTGR 


CCGTATATAT 


AAAAACCTCC 


ACAGAGTGAC 


TTGAAATCCT 


TATACTTTCA 


1920 


45 


AGTGCATGAA 


ACAACACGTC 


TTCTATCTTT 


GTGCTGTTGT 


GCGAGATAGT 


GCGTTTTCAC 


1980 




GTACTACTCA 


CATTACCCAC 


ATCTGTCGGG 


GATAAAATCC 


GASATTTGAA 


AGAAAAGCTT 


2040 




TAAAACTGAA 


AATGGCACGT 


GATGTTGGTT 


GCTGTCGATG 


TCATTACAAA 


GCAAACTATA 


2100 


50 


AATACCTATA 


CTATATACAT 


ATCTTTGATA 


TTTGTTCTTA 


ATATGATGTG 


ATGTAGCTTT 


2160 




ATTTTAGGGA 


CATCAGAGAA 


ACGGTAGCCT 


AAGCTCAAAA 


TTAGAGCTTT 


TTGTAAAATC 


2220 
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AATCCTGTTA ATTGCTATAT AATTATTTCC ATTTCTTTTA TTCTCTGATG KYCYYMAARK 2 2 80 

WAMYTCGATG TAACCTTATG TGTAACTTGA GTGAATATCA CGTTCCTATC CCTCTGATTA 234 0 

TGCTGCAATA GGAACTTCTG TTTCCAAATG AATCTTGAGA TTTTCTTCTT TATAGTATCA 24 0 0 

5 

TATCCTTAGG TTTGTA 2416 

( 2 ) INFORMATION FOR SEQ ID NO : 2 : 

<i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : s ingl e 

(D) TOPOLOGY: linear 



75 



25 



30 



35 



40 



(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

ATTAGCGTTC AAAAGCGATG CGAAG CTGGG ACTGCGCTCT CAGGCCATGA GCCGCATGCA 60 

20 GGGCATGAGG GT AC GT AC CA CCCTGTGCTG CCGACAACAC CCTATCGCTC ATCCATCCAC 12 0 

CACACACTTC GCTCCACACT TCACATTCAC ATTT CTATTT CAACTTCTAC GATCATTTTT 180 

TAACATTTTA AAATTTCCAA CGTRCCAGCC GTACTMGGGC TCCTTTTTTC GATATTTCTG 24 0 

- CATSAATCAC CGGATCAAAA TTTGTTTTTA ATAGTTAATT TGGACAGTTA TCCGATTCAT- — 3^0- 

TGGCAGTAGT CGATTGAAGT AATTATTAGT GAATCATTTT GAAGTGGTCG GTGGCACCCC 3 60 

TGAATGGCTT AGTATCATCA CTGTTCGTCA TAAACCT CTT TTAGAAAGGG TCAATGGGAT 420 

TTATTGTGGA GAGATATTYR TCCATGTTTT GGTCTCTTTT CTATTGGTCT TATTATTAGC 480 

TAGATTAGAC TTTTGTAATT ACTTAGTTAT TTGGAATGCT AATTTATATT CTGCACCTTA 540 

GATTTTTTCT T CT T GT AT CT TCATCGA 567 
(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



50 



GCTAACTGCT 


ACATAGTTAC 


TGCACAGTAT 


TAATGACATT 


AACGTCCTTA 


TATCCCAACT 


60 


AATAATGCGC 


CCACTAACAA 


ATGCACGCCA 


TTGATATAAG 


AAAGGAGACG 


TATCAGTACT 


120 


TCCAATATAT 


CCTTCGTGAC 


CAGTGTAGTA 


ATACGTACGT 


ATGTGACAGG 


TGGTGGTAAA 


180 


CGCTCTCGTG 


CAAGCGATCC 


CGTCCATCTT 


CAACGTGTTG 


TTGGTGTGTC 


TTATCTTCTG 


240 
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GCTGATCTTC 


GC CAT CAT GG 


GAGTACAACT 


GTTCGCTGGC 


AAATATTTCA 


AGGTATTAAT 


300 




TTATTAACAT 


AACAAAAAAA 


TATTTCAATT 


CGTAAAATCT 


TATTAGTGTG 


TTCAAAATTT 


360 


5 


CTAACATGTT 


TTTCTTTGTT 


CTGTTCTAGT 


GCGTCGACCT 


CAACCACACG 


ACGTTGAGCC 


420 




ACGAAATCAT 


CCCAGACCGG 


AATGCGTGCA 


T CTT AGAG AA 


CTACACCTGG 


GAGAACTCAC 


480 




CGATGAACTT 


TGACCATGTC 


GGCAAGGCGT 


ATCTCTGCCT 


GTTCCAAGTG 


GCCACCTTCA 


540 


70 


AGGGATGGAT 


ACAGATCATG 


AACGACGCTA 


TTGATTCGAG 


AGAAGTATGG 


CTACTATTTC 


600 




TTTTCCTTTT 


GTTCATAAGT 


TCATAAATTA 


ATATCAATAA 


AAATATCACG 


CAATACAATA 


660 




AATGATATTG 


TTAATGCCAG 


GTGGGCCGGC 


AACCTATACG 


CGAGACGAAC 


ATCTACATGT 


720 


75 


ACCTGTACTT 


CGTGTTCTTC 


ATCATATTTG 


GCTCATTCTT 


CACTCTCAAC 


CTATTCATCG 


780 




GTGTGATCAT 


CGACAACTTT 


AACGAACAGA 


AGAAGAAAGC 


CGGCGGCAGC 


CTTGAGATGT 


840 




T CAT G ACT G A 


GGACCAGAAG 


AAATACTACA 


ATGCCATGAA 


GAAAATGGGT 


TCTAAAAAAC 


900 


20 


CTTTAAAAGC 


TATCCCGAGA 


CCGAAGGTAA 


CAGACGATTG 


CATT GTTTT T 


TGACCTCAAT 


960 


GGAAACATAT 


CCAAGGAGGA 


GCGAGTCTTA 


TATTTGAAAC 


TTGATAGTAA 


TATTGTTGTA 


1020 




TATTTTATAA 


TTTCATAAAC 


AGCAGTACTG 


CGGTAAACCA 


TTGTTTTCAA 


CGCCAGAAAC 


1080 


"2S~ 


TGCAGGACGT 


TTAATTATTG 


AGGGATGATT 


TTGCCTAGAA 


TCTATTCTAA 


GATTGATTTG 


1140 


GAGCCGTCCA 


CTTCCCAACG 


ACAGTTGCAG 


CATCTATGCC 


ACCGGACCAC 


GTCGTTGTAC 


1200 




CCAGATAAGA 


AAGCTTTCTA 


CCTAAATAAA 


CACTAACTGA 


AACTGTTTGT 


TCCAGTGGCG 


1260 




GCCACAAGCG 


ATCGTGTTCG 


AGATAGTGAC 


GGACAAGAAG 


TTCGACATGA 


TCATCATGTT 


1320 


30 


GTTCATCGGC 


CTCAACATGT 


TGACGATGAC 


GCTCGATCAC 


TACCAGCAGT 


CGGAGACCTT 


1380 




CAGCACTGTC 


CTCGACTACC 


TCAACATGAT 


ATTCATCGTG 


ATATTCAGTT 


CAGAGTGCCT 


1440 




ATTAAAAATG 


TTCGCCTTAC 


GCTACCATTA 


CTTTGTTGAG 


CCATGGAACT 


TGTTCGATTT 


1500 


35 


CGTAGTAGTC 


AATTTCTCAA 


TT CTTAGTGA 


GTATTTTGGG 


TCTCCTGTTA 


TTCCAATAGT 


1560 




AAAGTGTTTT 


CCATTTATAA 


TTTACTAATG 


ATACACTCTC 


TTTGTTCTCA 


GGTTTGGTAT 


1620 




TGAGTGATAT 


TATAGAAAAA 


TATTTTGTGT 


CACCCACGTT 


ACTGAGGGTG 


GTGAGAGTAG 


1680 


40 


CGAAGGTCGG 


TCGTGTGTTG 


CGTCTCGTGA 


AGGGTGCGAA 


GGGTATCCGG 


ACGTTATTGT 


1740 




TCGGGCTGGC 


CATGTCACTG 


CCAGCCTTAT 


TCAACATCTG 


TCTGCTGCTG 


TTCCTTGTGA 


1800 




TGTTCATCTT 


CGCCATCTTC 


GGCATGTCGT 


TCTTTATGCA 


CGTCAAAGAC 


AAAGGTGGTC 


1860 


45 


TCGACGACGT 


GTACAACTTC 


AAGACCTTCG 


TGCAGAGTAT 


GATCCTGCTA 


TTTCAGGTCA 


1920 




GTGTTACTAA 


TCATACTTTA 


GCGCCTCCTG 


GTTGCTTGAG 


GATGAATGAC 


CACAAGCAAC 


1980 




CAGCAGGGTT 


TATTCGTTCA 


AATTGAAAGT 


TAATTTTTAG 


CCGTTCAAGC 


ATCTAGTGTA 


2040 


50 


TGCTAATCTG 


TCTTATCGTT 


TGTCAGATGT 


CGACGTCNGC 


CGGCTGGGAC 


GGCGT GCTGG 


2100 




ACGGCATCAT 


CAACGAGGAG 


GAGTGCGAKC 


TGCCGGACAA 


CGAGCGCGGC 


TACCCCGGCA 


2160 
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ACTGCGGCTC TGCNACCATC GGCATCACCT ACCTGCTGTC CTACCTCGTC ATCTCCTTCC 222 0 

TCATCGTCAT CAACATGTAC ATCGCCGTCA TTCTCGAGAA TTACTCGCAG GCAAGTTGA 227 9 

5 (2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
1Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



75 



2C 



25 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Asp Asp Thr lie Ser Gin Lys Ser Tyr Gly Ser His Lys lie Arg 
15 10 15 

Ser Phe Lys Asp Glu Ser His Lys Gly Ser Ala Asp Thr lie Asp Gly 
20 25 30 

Xaa Xaa Xaa Lys Asp Ala Ser Lys Glu Glu Leu Gly Leu Glu Glu Glu 

35 40 45 

Met Val Glu Glu Glu Glu Asp Gly Lys Leu Asp Gly Gly Leu Gly Lys 
50 55 60 



Thr Asp lie lie Val Ala Ala Asp Glu Glu Val Val Asp Asp Ser" Pro 
65 70 75 80 

Ala Asp Cys Cys Pro Glu Pro Cys Tyr Ala Lys Phe Pro Phe Leu Val 
85 90 95 

30 G iy Asp Asp Glu Ser Pro Phe Trp Gin Gly Trp Gly Met Leu Arg Leu 

100 105 110 

Lys Thr Phe Lys Leu lie Glu Asn Thr Tyr Phe Glu Thr Ala Val lie 
115 120 125 



Thr Met lie Leu Leu Ser Ser Leu Ala Leu Ala Leu Glu Asp Val Asn 
130 135 140 

Leu Pro His Arg Pro lie Leu Gin Asp lie Leu Tyr Tyr Met Asp Arg 
145 150 155 160 

lie Phe Thr Val lie Phe Phe lie Glu Met Leu lie Lys Trp Leu Ala 
165 170 175 

Leu Gly Phe Gin Lys Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
180 185 190 

lie lie Val Met 
45 195 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ala Met Ser Arg Met Gin Gly Met Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

70 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

75 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Val Val Val Asn Ala Leu Val Gin Ala lie Pro Ser He Phe Asn Val 
15 10 15 

Leu Leu Val Cys Leu He Phe Trp Leu He Phe Ala lie Met Gly Val 
20 25 30 

Gin Leu Phe Ala Gly Lys Tyr Phe Lys Cys Val Asp Leu Asn His Thr 

~25 — - - 35 - 40 - - 45 

Thr Leu Ser His Glu lie He Pro Asp Arg Asn Ala Cys He Leu Glu 
50 55 60 

Asn Tyr Thr Trp Glu Asn Ser Pro Met Asn Phe Asp His Val Gly Lys 
30 65 70 75 80 

Ala Tyr Leu Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp He Gin 
85 90 95 



35 



40 



45 



50 



He Met Asn Asp Ala He Asp Ser Arg Glu Val Gly Arg Gin Pro He 
100 105 110 

Arg Glu Thr Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe lie He 
115 120 125 

Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp 
130 135 140 

Asn Phe Asn Glu Gin Lys Lys Lys Ala Ala Gly Ser Leu Glu Met Phe 
145 150 155 160 

Met Thr Glu Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly 
165 170 175 

Ser Lys Lys Pro Leu Lys Ala He Pro Arg Pro Lys Trp Arg Pro Gin 
180 185 190 

Ala He Val Phe Glu He Val Thr Asp Lys Lys Phe Asp Met He He 
195 200 205 

Met Leu Phe He Gly Leu Asn Met Leu Thr Met Thr Leu Asp His Tyr 
210 215 220 
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10 



15 



Gin Gin Ser Glu Thr Phe Ser Thr Val Leu Asp Tyr Leu Asn Met lie 
225 230 235 240 

Phe lie Val lie Phe Ser Ser Glu Cys Leu Leu Lye Met Phe Ala Leu 
245 250 255 

Arg Tyr His Tyr Phe Val Glu Pro Trp Asn Leu Phe Asp Phe Val Val 
260 265 270 

Val Asn Phe Ser lie Leu Ser Leu Val Leu Ser Asp lie lie Glu Lys 
275 280 285 

Tyr Phe Val Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val 
290 295 300 

Gly Arg Val Leu Arg Leu Val Lys Gly Ala Lys Gly lie Arg Thr Leu 
305 310 315 320 

Leu Phe Gly Leu Ala Met Ser Leu Pro Ala Leu Phe Asn lie Cys Leu 
325 330 335 

Leu Leu Phe Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe 
340 345 350 

Phe Met His Val Lys Asp Lys Gly Gly Leu Asp Asp Val Tyr Asn Phe 
355 360 365 

Lys Thr Phe Val Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser 
370 375 380 

Ala Gly Trp Asp Gly Val Leu Asp~Gly lie ~He Asn Glu Glu Glu Cys 
385 390 395 400 

Asp Leu Pro Asp Asn Glu Arg Gly Tyr Pro Gly Asn Cys Gly Ser Ala 
405 410 415 

30 Thr He Gly He Thr Tyr Leu Leu Ser Tyr Leu Ala Ala Val lie Ser 

420 425 430 

Phe Leu lie Val He Asn Met Tyr He Ala Val He Leu Glu Asn Tyr 
435 440 445 



20 



"25 



35 



40 



45 



50 



Ser Gin Ala Ser 
450 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

ATGACAGAAG ATTCCGACTC GATATCTGAG GAAGAACGCA GTTTGTTCCG TCCCTTTACC 60 

CGCGAATCAT TGGTGCAAAT CGAACAACGC ATTGCCGCTG AACATGAAAA GCAGAAGGAG 12 0 

CTGGAAAGAA AGAGAGCCGA GGGAGAGGTG CCGCGATATG GTCGCAAGAA AAAACAAAAA 180 
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GAAATCCGAT 


ATGATGACGA 


GGACGAGGAT 


GAAGGTCCAC 


AACCGGATCC 


TACACTTGAA 


240 




CAGGGTGTGC 


CAATACCTGT 


TCGATTGCAG 


GGCAGCTTCC 


CGCCGGAATT 


GGCCTCCACT 


300 


5 


CCTCTCGAGG 


ATATCGATCC 


CTACTACAGC 


AATGTACTGA 


CATTCGTAGT 


TGTAAGCAAA 


360 




GGAAAAGATA 


TTTTTCGCTT 


TTCTGCATCA 


AAAGCAATGT 


GGATGCTCGA 


TCCATTCAAT 


420 




CCGATACGTC 


GTGTGGCCAT 


TTACATTCTA 


GTGCATCCAT 


TATTTTCCCT 


ATTCATCATC 


480 


10 


ACCACAATTC 


T CGT CAACTG 


CATCCTGATG 


ATAATGCCGA 


CAACGCCCAC 


GGTTGAGTCC 


540 




ACTGAGGTGA 


TATTCACCGG 


AATCTACACA 


TTTGAATCAG 


CTGTTAAAGT 


GATGGCACGA 


600 




GGTTTCATTT 


TATGCCCGTT 


TACGTATCTT 


AGAGATGCAT 


GGAATTGGCT 


GGACTTCGTA 


660 


15 


GTAATAGCTT 


TAGCTTATGT 


GACCATGGGT 


ATAGATTTAG 


GTAATCTAGC 


AGCCCTGCGA 


720 




ACGTTTAGGG 


TGCTGCGAGC 


GCTTAAAACC 


GTAGCCATTG 


TGCCAGGCTT 


GAAGACCATC 


780 




GTCGGCGCCG 


TCATCGAATC 


GGTGAAGAAT 


CTGCGCGATG 


TGATTATCCT 


GACCATGTTC 


840 


20 


TCCCTGTCGG 


TGTTCGCGTT 


GATGGGCCTA 


CAGATCTATA 


TGGGCGTGCT 


CACCGAGAAG 


900 


TGCATCAAGA 


AGTTCCCGCT 


GGACGGTTCC 


TGGGGCAATC 


TGACCGACGA 


GAACTGGGAC 


960 




TATCACAATC 


GCAATAGCTC 


CAATTGGTAT 


TCCGAGGACG 


AGGGCATCTC 


ATTTCCGTTA 


1020 


25 


TGCGGCAATA 


TATCCGGTGC 


GGGGCAATGC 


GACGACGATT 


ACGTGTGCCT 


GCAGGGGTTT 


1080 


GGTCCGAATC 


CGAATTATGG 


CTACACCAGC 


TTCGATTCGT 


TCGGATGGGC 


TTTCCTGTCC 


1140 




GCCTTCCGGC 


TGATGACACA 


GGACTTCTGG 


GAGGATCTGT 


ACCAGCTGGT 


GTTGCGCGCC 


1200 




GCCGGACCAT 


GGCACATGCT 


GTTCTTTATA 


GTCATCATCT 


TCCTAGGTTC 


ATTCTATCTT 


1260 


30 


GTGAATTTGA 


TTTTGGCCAT 


TGTTGCCATG 


TCGTATGACG 


AATTGCAAAG 


GAAGGCCGAA 


1320 




GAAGAAGAGG 


CTGCCGAAGA 


GGAGGC GAT A 


CGTGAAGCGG 


AAGAAGCTGC 


CGCCGCCAAA 


1380 




GCGGCCAAGC 


TGGAGGAGCG 


GGCCAATGCG 


CAGGCTCAGG 


CAGCAGCGGA 


TGCGGCTGCC 


1440 


35 


GCCGAAGAGG 


CTGCACTGCA 


TCCGGAAATG 


GCCAAGAGTC 


CGACGTATTC 


TTGCATCAGC 


1500 




TATGAGCTAT 


TTGTTGGCGG 


CGAGAAGGGC 


AACGATGACA 


ACAACAAAGA 


GAAGATGTCC 


1560 




ATTCGGAGCG 


TCGAGGTGGA 


GTCGGAGTCG 


GTGAGCGTTA 


TACAAAGACA 


ACCAGCACCT 


1620 


40 


ACCACAGCAC 


ACCAAGCTAC 


CAAAGTTCGT 


AAAGTGAGCA 


CGTACACGAT 


ACGGAACGGA 


1680 




CGTGGCCGCT 


TTGGTATACC 


CGGTAGCGAT 


CGTAAGCCAT 


TGGTATTGTC 


AACAT AT CAG 


1740 










uAU X W w#W% X w 


CCGTCACCCC 


GATGTCCGAA 


1800 


45 


GAGAATGGGG 


CCATCATAGT 


GCCCGTGTAC 


TATGGCAATC 


TAGGCTCCCG 


ACACTCATCG 


1860 




TATACCTCGC 


ATCAGTCCCG 


AAT AT CGT AT 


ACCTCACATG 


GCGATCTACT 


CGGCGGCATG 


1920 




GCCGTCATGG 


GCGTCAGCAC 


AATGACCAAG 


GAGAGCAAAT 


TGCGCAACCG 


CAACACACGC 


1980 


50 


AATCAATCAG 


TGGGCGCCAC 


CAATGGCGGC 


ACCACCTGTC 


TGGACACCAA 


TCACAAGCTC 


2040 




GATCAT CGCG 


ACTACGAAAT 


TGGCCTGGAG 


TGCACGGACG 


AAGCTGGCAA 


GAT T AAA CAT 


2100 
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CATGACAATC 


CTTTTATCGA 


GCCCGTCCAG 


ACACAAACGG 


TGGTTGATAT 


GAAAGATGTG 


2160 




ATGGTCCTGA 


ATGACATCAT 


CGAACAGGCC 


GCTGGTCGGC 


ACAGTCGGGC 


AAGCGATCGC 


2220 


5 


GGTGTCTCCG 


TTTACTATTT 


CCCAACAGAG 


GACGATGACG 


AGGATGGGCC 


GACGTTCAAA 


2280 




GACAAGGCAC 


TCGAAGTGAT 


CCTCAAAGGC 


ATCGATGTGT 


TTTGTGTGTG 


GGACTGTTGC 


2340 




TGGGTTTGGT 


TGAAATTTCA 


GGAGTGGGTA 


TCGCTCATCG 


TCTTCGATCC 


CTTCGTCGAG 


2400 


10 


CTCTTCATCA 


CGCTGTGCAT 


TGTGGTCAAC 


ACGATGTTCA 


TGGCAATGGA 


TCACCACGAT 


2460 




ATGAACAAGG 


AGATGGAACG 


CGTGCTCAAG 


AGTGGCAACT 


ATTTCTTCAC 


CGCCACCTTT 


2520 




GCCATCGAGG 


CCACCATGAA 


GCTAATGGCC 


ATGAGCCCCA 


AGTACTATTT 


CCAGGAGGGC 


2580 


15 


TGGAACATCT 


TCGACTTCAT 


TATCGTGGCC 


CTATCGCTAT 


TGGAACTGGG 


ACTCGAGGGT 


2640 




GTCCAGGGTC 


TGTCCGTATT 


GCGTTCCTTT 


CGATTGCTGC 


GTGTATTCAA 


ACTGGCCAAG 


2700 




TCTTGGCCCA 


CACTTAATTT 


ACTCATTTCG 


ATTATGGGAC 


GCACCATGGG 


CGCTTTGGGT 


2760 


2C 


AATCTGACAT 


TTGTACTTTG 


CATT AT CAT C 


TTCATCTTTG 


CGGTGATGGG 


AATGCAACTG 


2820 


TTCGGAAAGA 


ATTATCATGA 


T CACAAGG AC 


CGCTTTCCGG 


ATGGCGACCT 


GCCGCGCTGG 


2880 




AACTTCACCG 


ACTTTATGCA 


CAGCTTCATG 


ATCGTGTTCC 


GGGTGCTCTG 


CGGAGAATGG 


2940 




ATCGAGTCCA 


TGTGGGACTG 


CATGTACGTG 


GGCGATGTCT 


CGTGCATTCC 


CTTCTTCTTG 


3000 


■ PC 


GCCACCGTTG 


TCATCGGCAA 


TCTTGTGGTA 


CTTAACCTTT 


TCTTAGCCTT 


GCTTTTGTCC 


3060 




AATTTTGGCT 


CATCTAGCTT 


ATCAGCGCCG 


ACTGCCGATA 


ACGATACGAA 


TAAAATAGCC 


3120 




GAGGCCTTCA 


ATCGAATTGG 


CCGATTTAAA 


AGTTGGGTTA 


AGCGTAATAT 


TGCTGATTGT 


3180 


30 


TT CAAGTT AA 


TACGTAACAA 


ATTGACAAAT 


CAAATAAGTG 


ATCAACCATC 


AGAGCATGGT 


3240 




GACAACGAAC 


TGGAGCTGGG 


CCACGACGAG 


ATCCTCGCCG 


ACGGCCTCAT 


CAAGAAGGGG 


3300 




ATCAAGGAGC 


AGACGCAACT 


GGAGGTGGCC 


ATCGGGGATG 


GCATGGAATT 


CACGATACAC 


3360 


35 


GGCGACATGA 


AGAACAACAA 


GCCGAAGAAA 


TCCAAATATC 


TAAATAACGC 


AACGGACGAC 


3420 




GACACTGCCA 


GCATTAACTC 


ATATGGTAGC 


CATAAGAATC 


GACCATTCAA 


GGACGAGAGC 


3480 




CACAAGGGCA 


GCGCCGAGAC 


GATGGAGGGC 


GAGGAGAAGC 


GCGACGCCAG 


CAAGGAGGAT 


3540 


40 


TTAGGTCTCG 


ACGAGGAACT 


GGACGAGGAG 


GGCGAATGCG 


AGGAGGGCCC 


GCTCGACGGT 


3600 




GATATCATTA 


TTCATGCACA 


CGACGAGGAT 


ATACTCGATG 


AATATCCAGC 


TGATTGCTGC 


3660 




CCCGATTCGT 


ACTATAAGAA 


ATTTCCGATC 


TTAGCCGGTG 


ACGATGACTC 


GCCGTTCTGG 


3720 


45 


CAAGGATGGG 


GCAATTTACG 


ACTGAAAACT 


TTTCGATTAA 


TTGAGGATAA 


ATATTTTGAA 


3780 




ACAGCTGTTA 


TCACTATGAT 


TTTAATGAGT 


AGCTTAGCTT 


TGGCATTAGA 


AGATGTACAT 


3840 




CTGCCACAAA 


GAC C CAT ACT 


GCAGGATATT 


TTATACTATA 


TGGACAGAAT 


ATTTACGGTT 


3900 


50 


ATATTCTTCT 


TGGAAATGTT 


AATCAAGTGG 


TTGGCGCTCG 


GCTTCAAAGT 


GTACTTGACC 


3960 




AACGCGTGGT 


GTTGGCTCGA 


TTTCGTGATT 


GTCATGGTAT 


CGCTTATCAA 


CTTCGTTGCT 


4020 
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TCACTTGTTG 


GAGCTGGTGG 


TATTCAAGCC 


TTCAAGACTA 


TGCGAACGTT 


AAGAGCACTG 


4080 




AGACCACTAC 


GTGCCATGTC 


CCGTATGCAG 


GGCATGAGGG 


TCGTCGTTAA 


TGCGCTGGTA 


4140 


5 


CAAGCTATAC 


CGTCCATCTT 


CAATGTGCTA 


TTGGTGTGTC 


TAATATTTTG 


G CT AATTTTT 


4200 




GCCATAATGG 


GTGTACAGCT 


TTTTGCTGGA 


AAATATTTTA 


AGTGCGAGGA 


CATGAATGGC 


4260 




ACGAAGCTCA 


GCCACGAGAT 


CATACCAAAT 


CGCAATGCCT 


GCGAGAGCGA 


GAACTACACG 


4320 


70 


TGGGTGAATT 


CAGCAATGAA 


TTTCGATCAT 


GTAGGTAACG 


CGTATCTGTG 


CCTTTTCCAA 


4380 




GTGGCCACCT 


TCAAAGGCTG 


G AT ACAAAT C 


ATGAACGATG 


CT AT CGATTC 


ACGAGAGGTG 


4440 




GACAAGCAAC 


CAATTCGTGA 


AACGAACATC 


TACATGTATT 


TATATTTCGT 


ATT CTT CATC 


4500 


75 


ATATTTGGAT 


CATTTTTCAC 


ACTCAATCTG 


TTCATTGGTG 


TTATCATTGA 


TAATTTTAAT 


4560 




GAGCAAAAGA 


AAAAAGCAGG 


TGGATCATTA 


GAAATGTTCA 


TGACAGAAGA 


TCAGAAAAAG 


4620 




TACTATAGTG 


CTATGAAAAA 


GATGGGCTCT 


AAAAAACCAT 


TAAAAGCCAT 


TCCAAGACCA 


4680 


20 


AGGTGGCGAC 


CACAAGCAAT 


AGTCTTTGAA 


ATAGTAACCG 


ATAAGAAATT 


CGATATAATC 


4740 


ATTATGTTAT 


TCATTGGTCT 


GAACATGTTC 


ACCATGACCC 


TCGATCGTTA 


CGATGCGTCG 


4800 




GACACGTATA 


ACGCGGTCCT 


AG AC T AT CT C 


AATGCGATAT 


TCGTAGTTAT 


TTTCAGTTCC 


4860 


25 


GAATGTCTAT 


TAAAAATATT 


CGCTTTACGA 


TATCACTATT 


TTATTGAGCC 


ATGGAATTTA 


4920 


TTTGATGTAG 


TAGTTGTCAT 


TTTATCCATC 


TTAGGTCTTG 


TACTTAGCGA 


TATTATCGAG 


4980 




AAGTACTTCG 


TGTCGCCGAC 


CCTGCTCCGA 


GTGGTGCGTG 


TGGCGAAAGT 


GGGCCGTGTC 


5040 




CTTCGACTGG 


TGAAGGGAGC 


CAAGGGCATT 


CGGACACTGC 


TCTTCGCGTT 


GGCCATGTCG 


5100 


30 


CTGCCGGCCC 


TGTTCAACAT 


CTGCCTGCTG 


CTGTTCCTGG 


TCATGTTCAT 


CTTTGCCATT 


5160 




TTCGGCATGT 


CGTTCTTCAT 


GCACGTGAAG 


GAGAAGAGCG 


GCATTAACGA 


CGTCTACAAC 


5220 




TTCAAGACCT 


TTGGCCAGAG 


CATGATCCTG 


CTCTTTCAGA 


TGTCGACGTC 


AGCCGGTTGG 


5280 


35 


GATGGTGTAC 


TGGACGCCAT 


TATCAATGAG 


GAAGCATGCG 


ATCCACCCGA 


CAACGACAAA 


5340 




GGCTATCCGG 


GCAATTGTGG 


TTCAGCGACC 


GTTGGAATAA 


CGTTTCTCCT 


CTCATACCTA 


5400 




GTTATAAGCT 


TTTTGATAGT 


TATTAATATG 


TACATTGCTG 


TCATTCTCGA 


GAACGGAATT 


5460 


40 


C 












5461 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 0 amino acids 
45 (B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Val Gin lie Glu Gin Arg lie Ala 
20 25 30 

Ala Glu Hie Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Glu Gly 
35 40 45 

Glu Val Pro Arg Tyr Gly Arg Lys Lys Lys Gin Lys Glu lie Arg Tyr 
50 55 60 

Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro Asp Pro Thr Leu Glu 
65 70 75 80 

Gin Gly Val Pro lie Pro Val Arg Leu Gin Gly Ser Phe Pro Pro Glu 
85 90 95 

Leu Ala Ser Thr Pro Leu Glu Asp He Asp Pro Tyr Tyr Ser Asn Val 
100 105 110 

Leu Thr Phe Val Val Val Ser Lys Gly Lys Asp He Phe Arg Phe Ser 
115 120 125 

Ala Ser Lys Ala Met Trp Met Leu Asp Pro Phe Asn Pro lie Arg Arg 
130 135 140 

Val Ala He Tyr He Leu Val His Pro Leu Phe Ser Leu Phe He lie 
145 150 155 160 



Thr Thr He Leu Val Asn Cys lie Leu Met He Met Pro Thr Thr Pro 
165 170 175 

Thr Val Glu Ser Thr Glu Val He Phe Thr Gly He Tyr Thr Phe Glu 
180 185 190 

Ser Ala Val Lys Val Met Ala Arg Gly Phe He Leu Cys Pro Phe Thr 
195 200 205 

Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe Val Val He Ala Leu 
210 215 220 

Ala Tyr Val Thr Met Gly He Asp Leu Gly Asn Leu Ala Ala Leu Arg 
225 230 235 240 

Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val Ala He Val Pro Gly 
245 250 255 

Leu Lys Thr lie Val Gly Ala Val He Glu Ser Val Lys Asn Leu Arg 
260 265 270 

Asp Val He He Leu Thr Met Phe Ser Leu Ser Val Phe Ala Leu Met 
275 280 285 

Gly Leu Gin He Tyr Met Gly Val Leu Thr Glu Lys Cys He Lys Lys 
290 295 300 

Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr Asp Glu Asn Trp Asp 
305 310 315 320 

Tyr His Asn Arg Asn Ser Ser Asn Trp Tyr Ser Glu Asp Glu Gly He 
325 330 335 

Ser Phe Pro Leu Cys Gly Asn He Ser Gly Ala Gly Gin Cys Asp Asp 
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340 345 350 

Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn Pro Asn Tyr Gly Tyr 
355 360 365 

Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu Ser Ala Phe Arg Leu 
370 375 380 

Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin Leu Val Leu Arg Ala 
385 390 395 400 

Ala Gly Pro Trp His Met Leu Phe Phe lie Val He He Phe Leu Gly 
405 410 415 

Ser Phe Tyr Leu Val Asn Leu lie Leu Ala lie Val Ala Met Ser Tyr 
420 425 430 

Asp Glu Leu Gin Arg Lys Ala Glu Glu Glu Glu Ala Ala Glu Glu Glu 
435 440 445 

Ala lie Arg Glu Ala Glu Glu Ala Ala Ala Ala Lys Ala Ala Lys Leu 
450 455 460 

Glu Glu Arg Ala Asn Ala Gin Ala Gin Ala Ala Ala Asp Ala Ala Ala 
465 470 475 480 

Ala Glu Glu Ala Ala Leu His Pro Glu Met Ala Lys Ser Pro Thr Tyr 
485 490 495 

fi er gy a He Ser Tyr Glu Leu Phe Val Gly Gly Glu Lys Gly_Asn Asp_ 
500 505 510 

Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser Val Glu Val Glu Ser 
515 520 525 

Glu Ser Val Ser Val He Gin Arg Gin Pro Ala Pro Thr Thr Ala His 
530 535 540 

Gin Ala Thr Lys Val Arg Lys Val Ser Thr Tyr Thr He Arg Asn Gly 
545 550 555 560 

Arg Gly Arg Phe Gly He Pro Gly Ser Asp Arg Lys Pro Leu Val Leu 
565 570 575 

Ser Thr Tyr Gin Asp Ala Gin Gin His Leu Pro Tyr Ala Asp Asp Ser 
580 585 590 

Asn Ala Val Thr Pro Met Ser Glu Glu Asn Gly Ala He He Val Pro 
595 600 605 

Val Tyr Tyr Gly Asn Leu Gly Ser Arg His Ser Ser Tyr Thr Ser His 
610 615 620 

Gin Ser Arg lie Ser Tyr Thr Ser His Gly Asp Leu Leu Gly Gly Met 
625 630 635 640 

Ala Val Met Gly Val Ser Thr Met Thr Lys Glu Ser Lys Leu Arg Asn 
645 650 655 

Arg Asn Thr Arg Asn Gin Ser Val Gly Ala Thr Asn Gly Gly Thr Thr 
660 665 670 

Cys Leu Asp Thr Asn His Lys Leu Asp His Arg Asp Tyr Glu He Gly 
675 680 685 
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Leu Glu Cys Thr Asp Glu Ala Gly Lys lie Lys His His Asp Asn Pro 
690 695 700 

Phe lie Glu Pro Val Gin Thr Gin Thr Val Val Asp Met Lys Asp Val 
705 710 715 720 

Met Val Leu Asn Asp lie lie Glu Gin Ala Ala Gly Arg His Ser Arg 
725 730 735 

Ala Ser Asp Arg Gly Val Ser Val Tyr Tyr Phe Pro Thr Glu Asp Asp 
740 745 750 

Asp Glu Asp Gly Pro Thr Phe Lys Asp Lys Ala Leu Glu Val lie Leu 
755 760 765 

Lys Gly He Asp Val Phe Cys Val Trp Asp Cys Cys Trp Val Trp Leu 
770 775 780 

Lys Phe Gin Glu Trp Val Ser Leu He Val Phe Asp Pro Phe Val Glu 
785 790 795 800 

Leu Phe lie Thr Leu Cys He Val Val Asn Thr Met Phe Met Ala Met 
805 810 815 

Asp His His Asp Met Asn Lys Glu Met Glu Arg Val Leu Lys Ser Gly 
820 825 830 

Asn Tyr Phe Phe Thr Ala Thr Phe Ala He Glu Ala Thr Met Lys Leu 
835 840 845 



Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin Glu Gly Trp Asn He Phe 
850 855 860 

Asp Phe He He Val Ala Leu Ser Leu Leu Glu Leu Gly Leu Glu Gly 
865 870 875 880 

Val Gin Gly Leu Ser Val Leu Arg Ser Phe Arg Leu Leu Arg Val Phe 
885 890 895 

Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Leu Leu He Ser He Met 
900 905 910 

Gly Arg Thr Met Gly Ala Leu Gly Asn Leu Thr Phe Val Leu Cys He 
915 920 925 

He He Phe He Phe Ala Val Met Gly Met Gin Leu Phe Gly Lys Asn 
930 935 940 

Tyr His Asp His Lys Asp Arg Phe Pro Asp Gly Asp Leu Pro Arg Trp 
945 950 955 960 

Asn Phe Thr Asp Phe Met His Ser Phe Met He Val Phe Arg Val Leu 
965 970 975 

Cys Gly Glu Trp He Glu Ser Met Trp Asp Cys Met Tyr Val Gly Asp 
980 985 990 

Val Ser Cys He Pro Phe Phe Leu Ala Thr Val Val He Gly Asn Leu 
995 1000 1005 

Val Val Leu Asn Leu Phe Leu Ala Leu Leu Leu Ser Asn Phe Gly Ser 
1010 1015 1020 

Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn Asp Thr Asn Lys He Ala 
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1025 1030 1035 1040 

Glu Ala Phe Asn Arg lie Gly Arg Phe Lys Ser Trp Val Lys Arg Asn 
1045 1050 1055 

lie Ala Asp Cys Phe Lys Leu lie Arg Asn Lys Leu Thr Asn Gin lie 
1060 1065 1070 

Ser Asp Gin Pro Ser Glu His Gly Asp Asn Glu Leu Glu Leu Gly His 
1075 1080 1085 

Asp Glu lie Leu Ala Asp Gly Leu lie Lys Lys Gly lie Lys Glu Gin 
1090 1095 1100 

Thr Gin Leu Glu Val Ala lie Gly Asp Gly Met Glu Phe Thr lie His 
1105 1110 1115 1120 

Gly Asp Met Lys Asn Asn Lys Pro Lys Lys Ser Lys Tyr Leu Asn Asn 
1125 1130 1135 

Ala Thr Asp Asp Asp Thr Ala Ser lie Asn Ser Tyr Gly Ser His Lys 
1140 1145 1150 

Asn Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr Met 
1155 1160 1165 

Glu Gly Glu Glu Lys Arg Asp Ala Ser Lys Glu Asp Leu Gly Leu Asp 
1170 1175 1180 

Glu Glu Leu Asp Glu Glu Gly Glu Cys Glu Glu Gly Pro Leu Asp Gly 
1185 1190 1195 1200 

Asp lie He He His Ala His Asp Glu Asp He Leu Asp Glu Tyr Pro 
1205 1210 1215 

Ala Asp Cys Cys Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala 
1220 1225 1230 

Gly Asp Asp Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1235 1240 1245 

Lys Thr Phe Arg Leu lie Glu Asp Lys Tyr Phe Glu Thr Ala Val He 
1250 1255 1260 

Thr Met He Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1265 1270 1275 1280 

Leu Pro Gin Arg Pro He Leu Gin Asp He Leu Tyr Tyr Met Asp Arg 
1285 1290 1295 

He Phe Thr Val He Phe Phe Leu Glu Met Leu He Lys Trp Leu Ala 
1300 1305 1310 

Leu Gly Phe Lys Val Tyr Leu Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1315 1320 1325 

Val He Val Met Val Ser Leu He Asn Phe Val Ala Ser Leu Val Gly 
1330 1335 1340 

Ala Gly Gly He Gin Ala Phe Lys Thr Met Arg Thr Leu Arg Ala Leu 
1345 1350 1355 1360 

Arg Pro Leu Arg Ala Met Ser Arg Met Gin Gly Met Arg Val Val Val 
1365 1370 1375 
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Asn Ala Leu Val Gin Ala He Pro Ser He Phe Asn Val Leu Leu Val 
1380 1385 1390 

Cys Leu lie Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe 
1395 1400 1405 

Ala Gly Lys Tyr Phe Lys Cys Glu Asp Met Asn Gly Thr Lys Leu Ser 
1410 1415 1420 

His Glu He He Pro Asn Arg Asn Ala Cys Glu Ser Glu Asn Tyr Thr 
1425 1430 1435 1440 

Trp Val Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1445 1450 1455 

Cys Leu Phe Gin Val Ala Thr Phe Lye Gly Trp He Gin He Met Asn 
1460 1465 1470 

Asp Ala He Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr 
1475 1480 1485 

Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe He He Phe Gly Ser 
1490 1495 1500 

Phe Phe Thr Leu Asn Leu Phe He Gly Val lie lie Asp Asn Phe Asn 
1505 1510 1515 1520 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1525 1530 1535 



Asp Gin Lys Lys Tyr Tyr Ser Ala Met Lys Lys Met Gly Ser Lys Lys 
1540 1545 1550 

Pro Leu Lys Ala lie Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val 

1555 1560 1565 

30 phe Glu He Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe 

1570 1575 1580 

He Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1585 1590 1595 1600 

35 Asp Thr Tyr Asn Ala Val Leu Asp Tyr Leu Asn Ala He Phe Val Val 

1605 1610 1615 

He Phe Ser Ser Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His 
1620 1625 1630 

Tyr Phe He Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu 

40 1635 1640 1645 

Ser He Leu Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val 
1650 1655 1660 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
45 1665 1670 1675 1680 

Leu Arg Leu Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala 
1685 1690 1695 



50 



Leu Ala Met Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe 
1700 1705 1710 

Leu Val Met Phe He Phe Ala He Phe Gly Met Ser Phe Phe Met His 
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1715 1720 1725 

Val Lys Glu Lys Ser Gly lie Asn Asp Val Tyr Asn Phe Lys Thr Phe 
1730 1735 1740 

Gly Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1745 1750 1755 1760 

Asp Gly Val Leu Asp Ala He He Asn Glu Glu Ala Cys Asp Pro Pro 
1765 1770 1775 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1780 1785 1790 

He Thr Phe Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He 
1795 1800 1805 

Asn Met Tyr He Ala Val He Leu Glu Asn Gly He 
1810 1815 1820 

(2) INFORMATION FOR SEQ ID NO : 9 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

-25 -- (ii) MOLECULE TYPE r DNA (genomic) ■ - - ~ ■-— ~ — 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

30 ATGAGCCGCA TGCAGGGCAT GAGGGTACGT ACCACCCTGT GCTGCCGACA ACACCCTATC 60 

GCTCATCCAT CCACCACACA CTTCGCTCCA CACTTCACAT TCACATTTCT ATTTCAACTT 12 0 

CTACGATCAT TTTTTAACAT TTTAAAATTT CCAACGTRCC AGCCGTACTM GGGCTCCTTT 180 

TTTCGATATT TCTGCATSAA TCACCGGATC AAAATTTGTT TTTAATAGTT AATTTGGACA 240 

GTTATCCGAT TCATTGGCAG TAGTCGATTG AAGTAATTAT TAGTGAATCA TTTTGAAGTG 3 00 

GTCGGTGGCA CCCCTGAATG GCTTAGTATC ATCACTGTTC GTCATAAACC TCTTTTAGAA 3 60 

AGGGT CAATG GGATTTATTG TGGAGAGATA TTYRTCCATG TTTTGGTCTC TTTTCTATTG 420 

GTCTTATTAT TAGCTAGATT AGACTTTTGT AATTACTTAG TTATTTGGAA TGCTAATTTA 480 

TATTCTGCAC CTTAGATTTT TTCTTCTTGT ATCTTCATCG A 521 
(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





GCTAACTGCT 


ACATAGTTAC 


TGCACAGTAT 


TAATGACATT 


AACGTCCTTA 


TATCCCAACT 


60 


5 


AATAATGCGC 


CCACTAACAA 


ATGCACGCCA 


TTGATATAAG 


AAAGGAGACG 


TATCAGTACT 


120 




TCCAATATAT 


CCTTCGTGAC 


CAGTGTAGTA 


ATACGTACGT 


ATGTGACAGG 


TGGTGGTAAA 


180 




CGCTCTCGTG 


CAAGCGATCC 


CGTCCATCTT 


CAACGTGTTG 


TTGGTGTGTC 


TTATCTTCTG 


240 


10 


GCTGATCTTC 


GCCATCATGG 


GAGTACAACT 


GTTCGCTGGC 


AAATATTTCA 


AGGTATTAAT 


300 




TTATTAACAT 


AACAAAAAAA 


TATTTCAATT 


CGTAAAATCT 


TATTAGTGTG 


TTCAAAATTT 


360 




CTAACATGTT 


TTTCTTTGTT 


CTGTTCTAGT 


GCGTCGACCT 


CAACCACACG 


ACGTTGAGCC 


420 


75 


ACGAAATCAT 


CCCAGACCGG 


AATGCGTGCA 


TCTTAGAGAA 


CTACACCTGG 


GAGAACTCAC 


4B0 




CGATGAACTT 


TGACCATGTC 


GGCAAGGCGT 


ATCTCTGCCT 


GTTCCAAGTG 


GCCACCTTCA 


540 




AGGGATGGAT 


ACAGATCATG 


AACGACGC 
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Claims 

1. An isolated nucleic acid fragment comprising a nucleic acid sequence encoding a non-dipteran sodium 
-25 channel; or portion thereof. - - — - — 

2. The fragment of Claim 1 in which the channel is either lepidopteran, coieopteran or homopteran. 

3. The fragment of Claim 2 which is lepidopteran. 

30 

4. The fragment of Claim 3 which is derived from Heliothis , Helicoverpa or Spodoptera . 

5. The fragment of Claim 4 which is derived from Heliothis virescens , Heliothis armigera , or Helicoverpa 
zea . 

35 

6. The fragment of Claim 1 which hybridizes with a nucleic acid sequence depicted in Figure 1 under 
medium or high stringency conditions. 

7. The fragment of Claim 1 which comprises all or a portion of the sequence depicted in Figure 1. 

40 

8. The fragment of Claim 1 which is capable of being used as a probe to detect RFLPs in an insect 
population comprising both pyrethroid sensitive and pyrethroid resistant individuals. 

9. The fragment of Claim 1 which is detectably labelled. 

45 

10. An isolated nucleic acid fragment deposited with the American Type Culture Collection under Acces- 
sion No. 75334. 

11. A vector comprising the fragment of Claim 1 . 

50 

12. A host cell comprising the vector of Claim 11. 
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